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ABSTRACT 


A  principal  component  analysis  with  varimax  rotation 
of  the  principal  factors  vias  performed  for  watershed,  storm, 
and  runoff  data,  from  five  central  and  eastern  Montana  water- 
sheds. The  analyses  provided  information  about  the  relative 
importance  of  29  independent  variables  to  the  peak  discharge 
rates  and  runoff  volumes  produced  by  these  variables. 

Storm  intensity,   standard  deviation  of  storm  inten- 
sities,  soil  and  air  tem.perature ,  X'ratershed  azimaith,  over- 
land slope,  vratershed  shape,  reservoir  area,  and  watershed 
area  were  among  the  most  successively  important  variables. 
Correlations  among  some  of  the  variables  for  the  research- 
watersheds  were  also  indicated  by  the  analyses.  Principal 
component  and  rotated-factor  regression  equations  for  the 
runoff  variables  were  developed,  and  are  suggested  as  pre- 
diction equations  for  ungaged  watersheds  in  central  and 
ea  s  tern  M  on  tana. . 


Chapter  I 
INTRODUCTION 

In  order  to  make  reasonable  predictions  of  peak  dis- 
charge rates  and  total  runoff  volumes  on  small  watersheds, 
an  understanding  of  the  factors  causing  the  runoff  is  need- 
ed.    If  the  important  watershed  and  storm  variables  could 
be  properly  identified  and  measured,  the  relationships  be- 
tween these  variables  and  the  runoff  could  then  be  readily 
determined. 

HISTORICAL  BACKGROU^ID  -  , 

Many  attempts  have  been  made  to  relate  peak  discharge 
rates  and  total  runoff  volumes  from  a  watershed  to  their 
causative  factors.     The  relationships  which  have  been  pro- 
posed usually  take  the  form  of  graphs  or  equations  relating 
the  rates  and  volumes  to  factors  that  are  believed  to  be 
important.     The  general  trend  in  these  methods  is  to  choose 
the  "important"  factors,  obtain  measurements  of  each,  and 
then  relate  the  factors  to  the  rates  and  volumes  produced, 
Hov/ever,  the  choice  of  factors  is  usually  made  from  experi- 
ence or  judgment,  and  the  proper  or  improper  choice  of  fac- 
tors leads  to  accurate  or  inaccurate  results,     A  comparison 
of  the  methods  indicates  that  there  is  considerable  con- 
fusion as  to  which  factors  are  to  be  used,  and  v/hich 


2 

factors  are  more  important  than  others.     After  analyzing 
several  methods  of  discharge  rate  and  runoff  volume  pre- 
diction,  Sharp  and  Bisxvas  (I965)  v/rote:     "Exhaustive  analy- 
sis of  research  data  from  small  vratersheds  not  only  fail- 
ed to  reveal  how  various  factors  function  in  producing 
runoff,   but  failed  even  to  reveal  the  parameters  that 
should  be  used  to  estimate  runoff."     The  vjide  range  of 
factors  used  in  prediction  methods  seems  to  substantiate 
this.     Some  factors  appear  in  more  of  the  methods  than  do 
other  factors,  but  none  of  the  methods  agree  on  which  set 
of  factors  are  the  most  important. 

PURP03S  AND  MST^IQDS 

The  study  reported  herein  is  an  attempt  to  determine, 
for  five  central  and  eastern  Idontana  'watersheds,  which  of 
29  factors  are  more  important  in  producing  peak  discharge 
rates  and  total  runoff  volumes,  and  the  regression  equa- 
tions relating  peak  discharge  rates  and  runoff  volumes  to 
these  factors.     Multivariate  statistical  analyses  are  used 
to  investigate  the  relative  importance  of  each  of  the  inde- 
pendent variables  to  the  tvj-o  dependent  variables,  peak  dis- 
charge rate  and  total  runoff  volume.     As  explained  in  a 
later  chapter,  a  principal  component  analysis  of  the  cor- 
relation matrix  of  the  independent  variables  is  performed, 
and  this  is  followed  by  a  varimax  rotation  of  the  factor 
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weight  ma,tr:lz  to  deterraine  which  variables  are  most  im- 
portant.    Regression  equations  for  the  dependent  vari- 
ables are  found  from  the  important  variables,  using  the 
uncorrelated  principal  components  and  the  rotated  factor 
weight  matrix. 

Although  only  a  few  investigators  have  applied  multi- 
variate methods  to  hydrologic  data,   several  have  recommend- 
ed their  use  in  this  field.     The  advantages  which  are  pre- 
sented in  a  later  chapter  indicate  that  they  provide  a 
significant  improvement  over  some  of  the  other  methods 
used.     The  methods  are  employed  here  because  they  are  well 
suited  to  the  large  volumes  of  data  that  ?iave  been  gene- 
rated on  the  watersheds  being  investigated.     Because  of  the 
large  amounts  of  data  involved,  multivariate  analyses  were 
not  practical  until  the  advent  of  the  electronic,  digital 
computer.     Computer  techniques  were  used  extensively  in  the 
data  reduction  and  analyses  for  this  study. 

SCOPS  ' 

Data  for  this  study  vias  taken  from  five  small  central 
and  eastern  Montana  watersheds  currently  being  studied  for 
the  Drainage  Correlation  Research  Project,  by  the  Depart- 
ment of  Civil  Engineering  and  Engineering  Mechanics  at 
Montana  State  University.     The  Drainage  Correlation  Re- 
search Project,  which  was  initiated  in  19^3 »   is  sponsored 
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by  the  Montana  State  Highway  Commission  and  the  Bureau  of 
Public  Roads,  and  is  an  investigation  of  the  frequency  of 
peak  discharge  rates  for  snail  watersheds  in  Montana. 
Data  collection  is  expected  to  continue  until  September, 
1969.     Fifty  runoff  events  on  the  five  project  v/atersheds, 
having  peak  discharge  rates  greater  than  10  cfs,  and  oc- 
curring between  April,   1964,  and  September,   1967,  were 
studied  for  the  investigation  reported  herein. 


DEFINITIONS 

Because  certain  terms  and  phrases  are  frequently  used 

in  this  paper,  several  definitions  are  presented  here. 

The  definitions  are  those  commonly  used  in  the  literature, 

and  some  are  more  thoroughly  discussed  in  later  chapters. 

Dependent  variable  -  the  variable  to  be  predicted  from 
measurements  of  the  independent  variable (s)  in  a  re- 
gression eq^uation  (e.g.,  peak  discharge  rate). 

Independent  variables  -  the  variables  on  v/hich  meas- 
urements are  obtained  and  substituted  into  the  re- 
gression equation  to  calculate  the  prediction  of  the 
dependent  variable  (e.g.,  precipitation  intensity, 
watershed  area,  etc.) 

Regression  equation  -  an  equation  for  the  dependent 
variable,  derived  from  several  measurements  of  this 
variable  and  the  independent  variable,  or  variables, 
in  a  manner  which  indicates  the  relationship  of  the 
independent  variables  to  the  dependent  variable.  (If 
more  than  one  independent  variable  is  involved,  the 
equation  is  usually  termed  a  "multiple  regression 
equation" )  . 

Multiple  linear  regression  equation  -  a  multiple  re- 
gression equation  in  which  the  dependent  variable  is 
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related  to  a  sum  of  the  independent  variables,  with 
each  of  the  independent  variables  being  multiplied 
by  a  different  coefficient c  . 

Coefficient  of  a  variable  -  a  constant  to  be  multi- 
plied by  the  measurement  of  a  variable  in  a  regres- 
sion equation,  component,  or  factor. 

Multivariate  studies  -  methods  of  studying  more  than 
two  variables  v/hen  the  measurements  of  the  variables 
are  obtained  simultaneously  in  time  or  space. 

Linear  correlation  of  tv/o  variables  -  a  statistical 
measure  of  the  closeness  to  a  straight  line  of  the 
graphical  plot  of  measurements  of  both  variables. 

Components  -  Principal  Components  -  Normalized  Eigen- 
vectors -  derived,  ujicorrelated ,  independent  variables 
written  as  the  sums  of  the  original  independent  vari- 
ables, if  each  is  multiplied  by  a  coefficient. 

Factor  -  a  component  whose  coefficients  on  the  inde- 
pendent variables  have  been  multiplied  by  a  constant. 
The  squared  coefficients  within  a  factor  total  to  the 
square  of  the  constant. 

Rotated  factor  -  a  factor  whose  large  coefficients  on 
the  independent  variables  have  been  maximized, 

Variate  -  a  component,  factor,  or  rotated  factor. 


Chapter  II 
LIT'SrlA.TURE  PJ5VISV/ 

In  recent  years  a  great  many  investigators  have  ap- 
proached the  prob3.em  of  pea.k  discharge  frequencies  from 
small  i-ratersheds ,  and  have  applied  a  great  variety  of 
techniques  in  analyzing  the  problem.     Many  of  the  investi- 
gators have  used  statistical  methods,  and  several  have 
applied  multivariate  analyses  to  hydrologic  data.  Reports 
of  a  large  ni^inber  of  these  investigations  were  reviewed  in 
the  course  of  the  study  reported  herein.     To  discuss  all 
the  literature  which  was  reviewed  would  result  in  an  ex- 
tremely voluminous  and  unwieldy  report.     It  would  seem  more 
appropriate,   therefore,   to  confine  the  review  herein  to  the 
results  of  a  few  of  the  more  relevant  studies  vrhich  bear 
directly  on  the  present  investigation, 

INVESTIGATIONS  V/ITH  MULTI^ARIATjl  /VNALYS33 

Prior  to  1950,  multivariate  methods  were  well  estab- 
lished, but  were  not  practical  in  hydrology  because  of  the 
time-consuming  computations.    However,  as  the  high-speed 
computer  became  more  accessible  to  investigators,   the  meth- 
ods were  recognized  as  a  possible  means  of  studying  the 
peak  discharge  rate  and  runoff  volume  prediction  problem. 
For  example,  Wong  (19^3)  used  multivariate  methods  to 
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analyze  data  from  90  basins  in  New  England.     The  basins 
ranged  in  area  from  10  to  2  000  square  miles,  and  measure- 
ments of  eleven  independent  variables  were  talcen  on  each 
of  these  basins  to  determine  a  regression  equation  for  the 
mea-n  annual  flood  with  a  recurrence  interval  of  2,33  years. 

Wong  found  that  a  previous  "ordinary"  multiple  re- 
gression on  five  of  the  eleven    independent  A^ariables  was 
not  satisfactory  because  the  variables,  average  land  slope, 
mean  altitude,   tributary  chan:ael  slope,   stream  density, 
and  shape  of  basin  were,  according  to  Kor ton's  Laws,  "multi- 
collinear,"  meaning  that  som.e  were  linearly  related  to 
others  and  should  not  be  included.     After  obtaining  meas- 
urements on  six  additional  variables,  drainage  area,  main 
channel  slope,   tributary  channel  slope,  percentage  of  area 
in  ponds  and  lakes,  length  of  longest  watercourse,  and  pre- 
cipitation intensity;  a  principal  component  analysis  and 
varimax    rotation  were  performed  on  the  data.     This  resulted 
in  two  new,  unrelated  variables  or  "components."     Both  of 
these  components  were  linear  functions  of  all  the  meas- 
ured variables,  but  the  new  components  were  not  linearly 
related  to  each  other.     Also,   the  first  component  was 
found  to  be  significantly  more  important  than  the  second, 
and  each  component  vras  found  to  be  more  highly  associated 
with  certain  variables.     The  first  liad  large  coefficients 
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on  variables  which  expressed  the  area  and  length  of  the 
drainage  "basin,  and  the  second  W3,3  associated  v:ith  the  slope 
and  topography  of  the  drainage  basin,     3oth  components  vrere 
about  equally  associated  with  mean  annual  flood,  indicating 
that  both  were  important  to  the  dependent  variable. 

Because  the  two  components  indicated  that  size  and 
length  were  not  related  to  slope  and  topography,  I7ong  de- 
cided that  these  two  parameters  would  form  a  good  set  of 
independent  variables  for  a  multiple  regression  equation. 
He  examined  the  correlations  of  mean  annual  flood  with  the 
components  and  the  eleven  independent  variables,  and  found 
that  the  length  of  the  main  stream,  L,  was  highly  corre- 
lated x-;ith  both  the  mean  annual  flood  and  the  first  com- 
ponent; and  that  the  average  land  slope,  S,  was  similarly 
related  to  the  second  component  and  mean  annual  flood. 
These  two  variables  were  consequently  chosen  for  a  multi- 
ple regression,  giving: 

Log  Q2.33  =  -1.02  +  1.29  Log  L  +  0,97  Log  3 
for  the  regression  equation.     This  equation  had  a  coefficient 
of  determination  of  0,80,  which  meant  that  80  per  cent  of 
the  variation  in  the  mean  annual  flood  could  be  explained 
by  only  two  variables  instead  of  eleven.       The  previous 
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regression  on  five  independent  variables  had  the  sa:iie  coef- 
ficient of  determination,   but  involved  linearly  related 
variables.     The  study,   therefore,  reduced  the  nunber  of 
"independent"  variables  needed  to  explain  the  same  vari- 
ation in  nean  annua,l  flood  for  New  England, 

Eiselstein  (19^7)  performed  a  similar  analysis,  al- 
though he  was  interested  in  runoff  volume  instead  of  mean 
annual  flood  rate,     A.  J^O-acre  i-ratershed  was  divided  into 
17  runoff  plots  on  vrhich  data  from  30  variables  was  obtain- 
ed over  a  period  of  four  years.     The  variables  were  grouped 
into  five  categories,   storm  variables,  antecedent  moisture 
variables,  site  variables,   soil  description  variables,  and 
the  dependent  variable,  runoff  in  inches  from  each  plot. 

Because  Eiselstein  was  aware  of  the  inadequacy  of 
ordinary  multiple  regression  to  provide  a  good  predictioii 
equation  when  the  independent  variables  are  not  truly  in- 
dependent, he  performed  three  separate  analyses  to  show 
the  different  results  that  can  be  obtained.    An  ordinary 
linear  regression  analysis  gave  an  equation  in  terms  of 
all  29  "independent"  variables,  and  13  regression  coef- 
ficients were  found  to  be  statistically  significant,  i,e,, 
non-zero.     This  equation  accounted  for  77  psr  cent  of 
the  variation  in  the  runoff  volume,  but  Eiselstein  was 
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not  satisfied  with  the  results  because  the  non-signifi- 
cant variables  had  high  coefficients  of  correlation  v/ith 
each  other  and  v/ith  the  significant  variables.     Also,  the 
test  for  significance  v/as  not  valid  because  correlated 
variables  were  used.     This  meant  tha.t  the  "significant" 
variables  were  a  combined  measure  of  several  variables,  and 
the  *'non-signif icant"  variables  could  not  honestly  be  dis- 
carded. 

To  attempt  to  separate  the  combined  effects  of 
several  variables  to  the  effect  of  each,  a  principal  com- 
ponent analysis  of  the  correlation  coefficients  of  all 
combinations  of  the  independent  variables  was  performed. 
This  resulted  in  29  nev/,  independent  variables,  or  "com- 
ponents," which  were  each  linear  functions  of  all  the  29 
original  "independent"  variables.     The  first  of  these 
components  had  high  coefficients  in  the  rainfall  variables, 
but  the  remaining  components  could  not  be  readily  as- 
sociated v;ith  specific  variables.     This  is  the  reason 
for  the  varimax  rotation,  which  rotates  the  components  to 
another  set  of  reference  axes  so  that  only  high  or  low 
coefficients  exist  on  the  original  variables,  and  a 
better  interpretation  can  be  made. 

Before  rotating  the  "principal  components,"  Eiselstein 
computed  values  for  each  component  by  using  the  original 
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variable  dat.a  to  solve  the  linear  equations.     This  re- 
sulted in  B.  na^aerical  value  of  each  component  for  each 
runoff  event.     Because  the  components  v:ere  truly  independent, 
a  multiple  regression  on  these  values  was  performed.  After 
the  regression  coefficient  for  each  component  was  found,  18 
of  the  29  coefficients  exhibited  significance.     Also,  the 
regression  equation  explained  7?  per  cent  of  the  variation, 
•which  was  exactly  the  same  amount  explained  by  the  ordinary 
regression  equation.     This  a.nalysis  gave  a  good  prediction 
equation  because  truly  independent  variables  were  used. 
However,  because  each  significant  component  was  a  linear 
function  of  all  29  original  variables,  nothing  could  be 
stated  about  the  importance  of  the  original  variables  at 
this  point  in  the  analysis. 

To  further  investigate  the  separate  variables, 
Eiselstein's  third  analysis  consisted  of  a  multiple  re- 
gression on  rotated  components  instead  of  the  original 
components.    An  initial,  orthogonal  variamax  rotation  of 
the  original  components  yielded  a  set  of  components  which 
all  had  low  coefficients  on  12  of  the  original  variables. 
Because  these  variables  x^ere  not  significant  to  any  of  the 
rotated  components,  they  were  deemed  unimportant  to  the 
runoff  and  discarded.     Also,  only  the  first  seven  com- 
ponents were  deemed  to  be  important  because  the  rest  each 


12 

contributed  3.ess  than  one  per  cent  to  the  variation  of  the 
dependent  variable.     This  resulted  in  seven  linear  equations 
for  17  of  the  original  variables,  giving  essentially  the 
same  information  as  the  first  29  components  and  variables, 

A  second  varima.x  rotation  of  the  seven  components 
gave  seven  new  components  which  could  be  interpreted  in 
terms  of  the  17  remaining  independent  variables.     One  of 
the  components  accounted  for  51  per  cent  of  the  variation 
in  runoff,  and  had  high  coefficients  on  precipitation  in- 
tensity and  total  precipitation.     The  second  most  impor- 
tant component  accounted  for  5  per  cent  of  the  variation 
in  runoff  and  had  high  coefficients  on  slope,  elevation, 
and  "aspect."    The  third  component,  accounting  for  four 
per  cent  of  the  variation  in  runoff,  had  high  coefficients 
on  surface  soil  properties.     The  other  components  each 
explained  only  a  small  portion  of  the  variation  in  runoff, 
and  could  not  be  associated  with  specific  variables.  The 
important  interpretation  from  the  components  was  that  the 
rainfall  variables  v/ere  much  more  important  to  runoff  than 
the  aspect  and  soil  properties,  because  they  accounted  for 
51  per  cent  of  the  variation  in  runoff.     Also,  the  variables 
within  each  component  v/ere  linearly  dependent  and  their 
combined  effect  was  independent  of  the  combined  effect  of 
the  variables  of  other  components.     This  meant  that  a 
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multiple  regression  on  the  components,  or  on  certain  vari- 
ables from  each  component,  would  be  a  regression  on  truly 
independent  variables.     (V/ong  had  made  this  same  observa- 
tion in  1963  aiid  had  v^itten  his  final  regression  equation 
in  terms  of  two  nearly  independent  variables,  one  from 
each  important  rotated  component.     Other  variables  were  sig 
nificant  to  the  components,  but  the  two  he  chose  were 
combined  measures  of  those  in  each  component. ) 

Eiselstein  chose  to  find  a  multiple  regression  equa- 
tion for  all  17  of  the  variables  in  the  seven  components, 
rather  than  select  one  variable  from  each  component  to  re- 
present the  total  component.     After  substituting  data  from 
the  original  variables  into  the  component  equations,  a  mult 
pie  regT-ession  of  the  values  computed  gave  an  equation 
for  runoff  in  terms  of  the  components,  and  hence  in  terms 
of  the  17  independent  variables,  because  the  components 
were  linear  functions  of  the  variables.     The  coefficients 
of  this  final  equation  seemed  to  be  realistic  because  no 
obvious  fallacies  in  the  signs  of  the  coefficients  could 
be  detected.     For  example,  the  rainfall  characteristics 
v/ere  directly  and  not  inversely  related  to  runoff  as  is 
the  case  in  some  ordinary  multiple  regression  equations 
(Sharp,  Gibbs,  Owen,  and  Harris,   1960;  Wallis,  1965). 
The  final  coefficient  of  determination  v/as  0.67  indicating 
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a  10  per  cent  loss  of  information  as  a  result  of  reducing 
29  variables  to  I7,  and  29  components  to  ?. 

Snyder  (19^^1)  performed  a  principal  component  regression 
analysis  relating  total  December  runoff  from  a  watershed  to 
the  rainfall  in  October,  November,  and  December.     A  previous, 
ordinary  multiple  regression  equation  had  negative  coeffi- 
cients on  all  the  independent  variables  indicating  that  run- 
off increased  as  the  monthly  rainfall  decreased.  Because 
this  was  "intuitively"  inaccurate,  Snyder  obtained  three 
independent  components  from  a  principa-1  component  analysis, 
and  derived  a  regression  equation  for  these  components,  and 
hence  for  the  variables,  and  found  that  all  coefficients 
were  positive.     A  reduction  in  the  coefficients  of  deter- 
mination from  0.83  to  0.75  for  the  principal  components 
solution  resulted,  but  Snyder  felt  that  the  loss  in  infor- 
mation about  the  variation  in  runoff  was  justified  by  the 
intuitively  correct  coefficients.     This  study  was  used  to 
illustrate  the  possibilities  for  multivariate  analyses  in 
hydrologic  studies,  and  no  rotation  of  the  principal  com- 
ponents was  made,  because  the  component  regression  equation 
was  deemed  to  be  satisfactory. 

The  three  above  investigations  used  essentially  the 
same  techniques.     In  all  of  them,   the  ordinary  multiple 
regression  solution  was  unsatisfactory,  and  multivariate 
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techniques  were  used  to  overcome  this  difficulty.  Each 
used  a  principal  component  analysis  to  obtain  truly  in- 
dependent variables  for  the  regression.     Also,  rotation 
of  these  components  was  used  to  provide  information  on 
the  importance  of  certain  variables  in  tvro  of  the  papers, 

RECOririgNDATIOrlS  _0F  _.TrgJj?IgR^TTOE 

The  previous  section  discussed  some  of  the  applica- 
tions of  multivariate  ana.lyses.     A  short  presentation  of 
several  recommendations  found  in  the  literature  follc/rs. 
The  purpose  here  is  to  present  the  opinions  of  several 
authorities  on  the  use  of  multivariate  methods  with  hy- 
drologic  data. 

Eiselstein^s  investigation  was  undertaken  as  a  re- 
sult of  a  paper  by  '.'Jallis  (1965)  »  which  suggested  the  ad- 
vantages of  multivariate  methods  over  ordinary  multiple 
regression  for  hydrologic  studies.     V/allis  compared  the  pre- 
sently used  methods  of  obtaining  an  equation  for  the  de- 
pendent variable  in  terms  of  several  independent  variables, 
and  made  several  recommendations.     First,  he  suggested  that 
a  linear-logari tlmic  transformation  model,  similar  to  Wong^s, 
be  used  with  hydrologic  data.     This  should  be  followed  by 
a  principal  component  regression  analysis  with  varimax 
rotation  of  the  principal  components  for  an  initial  analysis 
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of  "multif actor  hydrolo£yic  problems,"     These  reconmenda- 
tions  were  made  after  a  study  of  the  adequacy  of  the  methods 
in  obtaining  a  knoiTn  fimctional  relationship,   the  ireifjht  of 
a  solid  cylinder  in  terms  of  its  density  and  dimensions, 

Wallis  (1968)  presented  several  other  suggestions  for 
the  best  utilization  of  multivariate  statistical  methods  in 
hydrologic  studies.     First  he  su.ggests  that  no  more  than  two 
variables  important  to  any  factor  in  the  rotated  factor  table 
be  reta.ined  for  further  study.     After  selecting  the  retained 
variables,  Wallis  suggests  that  a  com-plete  principal  com- 
ponent analysis  Tiith  varimax  rotation  of  the  principal  fac- 
tors be  performed  on  the  original  measurements  of  only  these 
variables.     A  regression  on  these  factors  is  then  suggested 
for  the  prediction  equation. 

After  his  analysis  of  90  Nexr  England  basins,  Wong 
(1963)   stated  that,   "multivatiate  methods  should  be  more 
widely  encouraged  in  geomorphic  and  hydrologic  research" 
(p,   198).     Eiselstein  recomjuended  that,   "a  principal  com- 
ponent analysis  with  varim.a.x  rotation  of  the  factor  weight 
matrix  is  a  suitable  statistical  technique  for  the  correla- 
tion of  small  watershed  su^rface  characteristics  with  surface 
runoff"   (p.  ^8^),     Although  Snyder  did  not  use  or  recomimend 
varimax  rotation,  he  did  state  that  principal  component 
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regression  analyses  give  "logical  equations"  for  runoff 
when  compared  to  ordinary  multiple  regression  techniques. 

ligTHOpS  APPLIED  TO  MONT^^  WATERSHSDS 

Multivariate  methods  have  not  "been  previously  applied 
to  small  watersheds  in  centra.1  and  eastern  Montana,  Boner 
(1963)  presented  a  report  of  his  study  of  the  frequency  and 
magnitude  of  floods  in  eastern  Montana  for  the  United  States 
Geological  Survey,     This  report  presents  an  ordinary  multi- 
ple regression  equation  for  mean  annual  flood  in  terms  of 
area,   stream  meander  length,  geographical  region,  and  elev- 
ation of  small  watersheds  in  eastern  Mont8,na.     The  dependent 
variable  was  found  to  be  directly  proportional  to  the  first 
three  variables,  and  inversely  related  to  elevation.  Also, 
the  flood  having  a  recurrence  interval,  I,   is  obtained  by 
multiplying  the  mean  annual  flood  by  a  factor  from  a  "com- 
posite flood  frequency  curve"  for  that  interval.  Variables 
other  than  those  listed  were  not  used  because  measurements 
were  not  available. 

Boner  and  Omang  (I967)  presented  a  report  on  the 
magnitude  and  frequency  of  floods  from  watersheds  smaller 
than  100  square  miles  in  area  in  Montana.     This  report  gives 
a  method  of  obtaining  the  floods  x^ith  recurrence  intervals 
of  10  and  25  years  for  this  region.     The  10-year  flood  is 
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found  from  a  regression  equation  involving  the  area,  eleva- 
tion, chajinel  slope,  and  nea,n  annual  runoff  of  a  watershed, 
and  the  25-year  flood  is  obtained  from  the  10-year  flood 
upon  multiplication  by  an  empirically  determined  constant. 

Both  of  the  above  studies  employed  ordinary  multiple 
regression  techniques  with  only  four  easily  determined  inde 
pendent  variables. 

The  studies  of  Wong,  Eiselstein,  and  Snyder  used  more 
variables  than  four,  but  the  developed  equations  could  not 
reasonably  be  used  in  Montana ,     However,  the  methods  should 
be  applicable  to  any  region.     None  of  the  literature  re- 
viewed indicates  that  attempts  h8,ve  been  made  to  determine 
equations  for  both  the  pealv  discharge  rate  and  the  runoff 
volume  in  a  single  analysis,  -  ■ 


Chapter  lil 
THEORETICAL  DEVELOPMENT 

In  this  chapter,  a  survey  of  the  a,vailable  methods 
of  analyzing  runoff  frora  sraall  watersheds  using  raeasure- 
ments  of  severs.l  related  variables  is  presented,  followed 
by  a  discussion  of  the  reasons  the  specific  methods  for 
this  analysis  were  chosen,     A  complete  theoretical  devel- 
opment of  the  methods  chosen  concludes  the  chapter, 

POSSIBLB^  MBITHODS^  0^_J-1^_A.I?IS1S 

Ordinary  multiple  linear  regression  is  one  of  the  few 
statistical  methods  of  sim/altaneously  analyzing  several 
variables  to  estimate  one  or  more  of  them.     The  analysis 
is  simply  an  analytical  miethod  of  plotting  a  line,  plane, 
or  hyperplane  through  a  multitude  of  data  points.  In 
genera.l,  a  linear  equation  with  an  unlcnown  intercept  and 
slope  is  assumed,  and  the  intercept  and  slope  are  calcu- 
lated from  the  data  in  a  manner  which  minimizes  the  squared 
distances  from  the  points  to  the  line,  plane,  or  hyperplane. 
Until  recently,  ordinary  multiple  linear  regression  of  the 
logarithms  of  the  variables  has  probably  had  the  most  use 
in  predicting  runoff  from  small  watersheds  (Sharp,  _et 
i960). 
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The  greatest  objection  to  the  use  of  multiple  re- 
gression analyses  with  hydrologic  data  is  that  certain 
basic  assuT.ptions  of  multiple  reg-ression  theory  are  vio- 
lated,    i-iultiple  regression  assujiies  that  there  arc  no  cor- 
relations among  the  independent  or  "predictor"  variables 
(Thur stone,  19-1-7)  •     Hydrologic  variables,  as  shovm  by 
V/ong,  usually  violate  this  assLimption.     If  ordinary  multi- 
ple regression  is  attempted,   the  coefficients  of  the  multi- 
ple regression  equa/cion  for  the  dependent  variable  have 
sometimes  been  found  to  be  "a.bsurd"  and  "grossly  in  error" 
(Thurstone,  19^7,  p.  61). 

Many  regression  equations  for  runoff  have  been  de- 
rived using  many  combinations  of  independent  variables. 
Snyder  (I962)   studied  some  of  these  equations  and  conclud- 
ed that  multiple  regression  analyses  do  not  yield  "logical 
equations"  when  used  in  hydrology.     In  these  cases,  he  was 
referring  to  the  coefficients  associated  with  each  of  the 
independent  variables  and  not  necessarily  with  the  accura- 
cy of  the  prediction  equation.     A  presenta.tion  of  the  mech- 
anics of  multiple  linear  regression  xras  given  by  DuBois 
(1957),  and  Baggaley  (196^0. 

Another  statistical  approach  to  the  problem  of  in- 
vestigating several  variables  was  developed  by  Harris, 
et  al  (1961)0     Their  purpose  was  to  present  a  method  of 
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selecting  the  most  important  independent  variables,  and 
to  use  only  these  varia.bles  in  the  regression  equation. 
A  Taylor  series  expansion  was  used  to  determine  succes- 
sively the  most  important  varia,bles  by  initially  removing 
the  effects  of  the  other  variables.     The  m.ethod  is  an 
analytic  a.pproach  to  "graphical  curvilinear  multiple  re- 
gression," and  eliminates  the  usual  "shotgun"  sea.rch  for 
important  variables.     The  method  provides  a  statistica.l 
mea.ns  of  determining  the  important  variables,  but  becaaise 
hydrologic  variables  are  generally  correlated,  as  discus- 
sed earlier,  attempts  at  m^ultiple  regression  are  often  not 
successful. 

Wong  (1963)  discusses  several  other  possible  methods 
of  analyzing  runoff  in  terms  of  several  independent  vari- 
ables.    One  of  these,  "stepwise  multiple  regression,"  is 
similar  to  Harris*  analysis  because  the  variable  which 
contributes  most  to  the  variation  in  the  dependent  vari- 
able is  determined.     Its  effects  are  then  removed,  and 
the  second  most  important  variable  is  found.     This  pro- 
cess is  repeated  until  enough  variables  are  found  to 
account  for  all  or  most  of  the  variation  in  the  dependent 
variable,  and  a  multiple  regression  is  performed  on  these 
variables.     Again,   the  relative  importance  of  the  inde- 
pendent variables  is  indicated,  but  the  multiple  regression 
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assumption  may  be  violatgcl.     Ralston  (I96O)  presents  a 
developraent  of  stepwise  multiple  regression,  and  outlines 
the  procedures  for  programming  the  method  for  digital 
computers. 

Another  method  of  handling  this  problem  is  by  using 
multivariate  statistical  analyses.     Severa.l  of  these  are 
outlined  in  the  next  section,  along  with  the  8,dvantages 
and  disadvantages  of  each.     The  methods  chosen  for  use  in 
this  investigation  are  indicated,  and  the  reasons  for  the 
selection  are  outlined, 

HULTIVARIATB  STATISTIC^ 

One  com:monly  used  multivaria.te  sta.tistical  analysis 
is  knoTm  as  "component  analysis."     There  are  two  varieties 
of  component  analysis,  known  as  "principal  component  analy- 
sis" and  "centroid  analysis."     Both  are  mathematical  means 
of  obtaining  new  variables  or  "components"  from  the  inter- 
correlations  of  the  chosen  independent  variables.  The 
objective  in  the  analysis  is  that  the  components  obtained 
vfill,   (1)  be  fewer  than  the  number  of  independent  va.riables 
under  study,   (2)  represent  or  reproduce  the  original  vari- 
ables,  (3)  account  for  all  the  variation  in  the  original 
variables  and,   (^)  be  uncorrelated  even  if  the  original  , 
variables  were  highly  correlated  with  each  other.  Principal 
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component  amlysis  extracts  the  nevr  variables,  or  "vari- 
ates,"  one  by  one.     The  first  conponent  is  extracted  in 
such  a  manner  that  it  reproduces  a  maximujn  a.mount  of  the 
information  in  the  original  data.     The  second  component 
accounts  for  a  m.aximujfl  amount  of  the  informa.tion  remaining 
after  the  extraction  of  the  first  com-ponent.     This  process 
is  repeated  until  all  the  components  have  been  extracted, 
a,nd  all  the  original  information  in  the  data,  is  reproduced 
by  the  components. 

Centroid  analysis  has  been  termed  a  "simplified 
approximation  of  the  principal  components  solution"  (Gooley 
and  Lohnes,   I962,  p,  153)  j  is  used  to  avoid  the  involved 

calculations  of  a  principal  component  analysis,  n-am^ely,  the 
solution  of  the  " clia.rac teris tic  equation,"  defined  later, 
Kendall  (1957)  gives  an  example  of  the  "approximate"  cen-  . 
troid  solution  compared  to  the  principal  component  solution 
of  the  same  problem,  and  shows  the  different  solutions 
obtained.     When  computer  availability  obviates  most  concern 
for  involved  computations,   the  principal  component  analysis 
is  the  better  method. 

"Factor  analysis"  is  a  multivariate  method  similar  to 
component  analysis,  but  differing  in  the  general  method  of 
approaching  a  problem,    A  factor  is  defined  as  a  linear 
equation  in  terms  of  all  or  some  of  the  original  variables, 
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iDut  is  not  the  same  as  a  component,     Kendall  (1957)  states 
tha.t  component  analyses  attempt  to  proceed  from  the  data  to 
a  model,  irhile  fa.ctor  analyses  begin  i-rith  a  model  and  in- 
vestiga-te  its  agreement  with  the  results.     In  factor  analy- 
ses, an  investigator  examines  the  data  and  speculates  on 
how  many  factors  or  groupings  of  variables  might  be  present, 
Guilford  (1952),   in  a  discussion  on  when  to  factor  analyze, 
illustrated  this  point  by  writing:     "The  initial  planning 
should  emphasize  the  formation  of  hypotheses  as  to  vrhat 
factors  are  likely  to  be  found  in  the  selected  domain  and 
to  the  probable  properties  of  such  factors"   (p.  36),  For 
example,   if  the  relationship  of  peak  discharge  rate  to 
several  variables  such  as  soil  moisture,  air  temperature, 
wind  speed,  watershed  area,   storm  duration,   stream  channel 
lengths,  excess  precipitation  intensity,  soil  permeability, 
etc,   is  desired,   then  the  variables  might  initially  be 
viewed  as  being  composed  of  two  factors  or  variable  "group- 
ings," one  of  climate,  and  one  of  watershed  characteristics. 
The  results  of  the  factor  analysis  are  two  or  more  factors, 
and  an  examina.tion  of  the  loadings  of  the  factors  will 
reveal  if  the  two-factor  assuicption  was  correct.     In  hydrol- 
OoYt  factors  might  easily  be  formulated,  although  none  of 
the  reviewed  papers  employed  this  method,     Eiselstein  (I967) 
grouped  the  original  independent  variables  into  four 
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categories,  but  he  did  not  initially  state  that  he  expected 
a  factor  for  each  category.     His  first  principal  component 
could  have  been  termed  a  "rainfall"  fa.ctor,  but  the  remain- 
ing co"-aponents  were  not  readily  associated  '-rith  th.e  cate- 
gories.    His  purpose,   that  of  principal  component  analyses, 
wa-s  to  derive  nevr,   truly  independent  variables  for  a.  mul- 
tiple linear  regression.     Components  are  independent,  and 
the  combined  effect  of  the  vs^riables  within  each  component 
is  independent  of  the  effect  of  other  components.  This 
means  that  the  importa,nt  variables  within  a  component  are 
not  necessarily  from  the  same  category,  as  is  hopefully 
the  case  with  factor  analyses.     No  initial  ass^umptions 
about  the  outcome  are  made  x-rith  component  analyses.  The 
data,  is  analyzed  only  for  uncorrelated  components,  and 
the  model  differs  from  that  of  a  factor  a.nalysis  in  the 
above  manjner.     If  factors  result,   they  are  coincidental, 
but  are  desirable  because  they  give  information  about  the 
importance  of  groups  of  variables. 

Neither  a  factor  analysis  nor  a  component  analysis 
provides  information  about  the  importance  of  each  independ- 
ent variable.     The  coefficients  on  the  variables  are  cor- 
relations of  the  variables  vrith  the  factors  or  components, 
but  they  generally  are  relatively  large  in  magnitude  for 
all  the  variables.     If  one  or  more  variables  in  a  component 
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analysis  has  a  small  correlation  with  all  the  components, 
then  the  variable  is  not  important  to  any  of  the  components, 
and  hence  to  the  tot-al  problem.     Because  the  first  component 
is  found  in  a  manner  which  yields  high  correlations  with 
all  the  variables,  no  interpretations  can  be  made,  and 
the  components  are  useful  only  when  regression  of  uncor- 
rela.ted  variables  is  desired. 

Because  a  principal  component  analysis  does  not  usu- 
ally provide  information  about  the  importance  of  each  inde- 
pendent variable,  another  multivariate  method,   "rotation  of 
the  principal  components,"  is  in  general  use.     The  compo- 
nents can  be  rotated  so  that  each  com,ponent  has  high  coef- 
ficients on  certain  variables  and  lovr  coefficients  on  other 
variables,  allowing  interpretations  for  the  important  vari- 
ables, and  obtaining  the  "simple  structure"  of  the  compo- 
nents (Matalas,  I967).     Gra.phical  rotation  presentations 
are  given  by  Fruchter  (195^)  and  Baggaley  (196^).  Both 
authors  use  two-dimensional  plots  which  show  the  measure- 
ments of  the  variables  as  points,  and  rotated  components 
are  simply  lines  dra^m  through  clusters  or  "streaks"  of 
points  so  as  to  m.aximize  zero-loadings  on  the  components. 
However,   the  graphical  solutions  are  approximate,  and 
different  investigators  might  obtain  different  results 
with  the  same  data.     Kaiser  (1958,  1959) i  derived  an 
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analytic  rotation  yrhich  guars.ntees  the  same  results  for 
different  investigators.     This  method  is  known  as  "varimax 
rotation,"  and  has  been  utilized  in  the  literature  (Rice, 
1967;  Wallls,  1965;  V/ons,  I967;  Eiselstein,  I967). 

Rotation  of  the  va^riates  from  either  a  component  or 
factor  analysis  has  no  effect  on  the  a.mount  of  information 
retained  by  the  varia.tes.     Only  the  interpretation  of  the 
loa-dings  is  affected  (Cooley  8,nd  Lohnes,  I962).  Kaiser^s 
"normal"  varimax  rota.tion  not  only  minimizes  the  "in- 
between"  loadings,  but  it  also  maintains  an  orthogonal  or 
perpendicular  reference  frame  (VJallis,   19^5;  Kaiser,  1953). 
This  feature  is  desirable  if  a  multiple  regression  on  the 
rotated  components  is  to  be  performed,   because  perpendicu- 
larity of  the  components  means  that  they  are  uncorrelated, 
and  the  assumptions  of  m-ultiple  regression  are  not  violated. 
"Oblique,"  or  non-perpendicular  analytical  rotations  such 
as  the  "Quartimin,"  "Oblimin,"  or  "Covarimin"  have  been 
developed,  but  do  not  a,ppear  satisfactory  (Cooley  and 
Lohnes,  I962). 

Multivariate  methods  are  advantageous  in  several 
aspects.     Ordinary  multiple  regression  provides  good  pre- 
diction results,  but  gives  no  insight  into  the  interre- 
lationships of  the  variables.     Multivariate  methods  obviate 
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the  effects  of  highly  correlated  "independent"  variables, 
while  ordinary  multiple  regression  analyses  do  not. 
Interpreta,tions  of  the  results  of  multivariate  analyses 
allo^r  the  exclusion  of  unimportant  variables  and  the  rec- 
ognition of  the  more  important  variables. 

Multivariate  methods  also  allow  the  reduction  of  the 
number  of  va.ri8.tes  for  multiple  regression.  Component 
analyses  produce  exactly  as  many  new  variates  as  there  are 
independent  variables.     However,   since  the  extraction  is 
done  on  a  "priority"  b3-sis,  some  of  the  latter  variates 
may  reproduce  only  a  small  portion  of  the  information, 
and  may  be  excluded,     Uong  (1967)1  2iselstein  (1967)1  and 
Rice  (1967)  were  all  able  to  considerably  reduce  the  nujuber 
of  variates  needed  to  reproduce  almost  all  of  the  informa- 
tion present.  - 

Besides  allow"ing  the  interpretation  of  the  importance 
of  each  variable  to  the  original  data,   the  orthogonality 
of  the  new  variates  is  a  principal  achievement.  Because 
the  data  can  be  expressed  by  uncorrelated  variates,  then 
multiple  regression  assumptions  are  not  violated  if  the 
regression  is  performed  on  these  variates.     The  new  inde- 
pendent variates  reproduce  the  original  data  and  are  truly 
uncorrelated.     Because  of  this,   the  correlations  among  the 
independent  variables  are  indicated  with  multivariate 
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methods  by  producing  uncorrelated  variates. 

Still  another  advantage  of  multivariate  methods  with 
correlated  variables  is  the  improvement  in  the  final  re- 
gression equation  coefficients,     Snj/der  (I962)  stated  that 
multivaria.te  methods  yield  "nice"  coefficients  which  in- 
dicate the  relative  importance  of  each  independent  variable 
to  the  criterion*     V/allis  (19o5)  demons tra.tes  that  princi- 
pal com.ponent  regression  coefficients  tend  to  be  "stable" 
when  com.pared  to  ordinary  regression  coefficients.  In 
a  discussion  of  Rice^s  paper,  Anderson  (I967)  stated  that 
the  "coefficients  remained  very  distinctly  realistic  with 
regression  on  principal  components"   (p.  6).     This,  and  the 
other  advantages  of  multivariate  methods  seem  to  provide 
justification  for  their  use  in  hydrologic  studies. 

Investigators  of  the  many  multivariate  methods  agree 
that  the  best  sta.tistical  system  of  analyzing  hydrologic 
data  would  start  with  a  principal  component  extraction, 
followed  by  a  varimax  rotation  for  interpretation  of  the 
variables,  and  a  multiple  regression  on  either  the  principal, 
components  or  the  rotated  factors  for  the  prediction  equa- 
tion (Eiselstein,  19^7;  Wallis,  1965;  Wong,  196?;  Anderson, 
1967).     Baggaley  (196^),  however,  states  that  good  rotation 
results  are  not  likely  unless  more  than  20  variables  are 
involved.     Because  29  independent  variables  were  available 
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for  the  analysis  reported  herein,  multivariate  methods 
were  indicated  for  the  present  study. 

The  methods  presented  in  this  section  were  developed 
by  Kendall  (1957)  and  Kaiser  (I958),  and  most  of  their 
equations  and  derivations  are  repeated  herein.  Graphical 
interpretations  of  the  methods  are  included  in  the  Appendix. 
Some  of  the  steps  in  Kendall's  and  Kaiser's  deriva.tions 
were  omitted  in  their  reports,  and  are  included  in  the 
present  development  because  of  their  im.portance. 

Model 

In  all  studies  which  involve  regression  as  a  final 
analysis,  a  m.odel  for  the  regression  must  be  assumed.  The 
variables  may  be  powered,  multiplied,  divided,  or  added. 
In  general,  multiple  regression  analyses  provide  the  coef- 
ficients for  linearly  additive  independent  variables.  This 
equation  usually  has  the  form 

Y  =  Bq  Xq  +        X-^  +  ,   .   .  +  Bp  Xp  (1) 

where  Y  is  the  dependent  variable  or  criterion,  X.  is 

1  - 

th 

the  i— ^  independent  variable,  and  3^  is  the  desired  regres- 
sion coefficient.     In  general,  Xq  is  unity,  and  the  equs,- 
tion  is  that  of  a  "hyperplane"  in  (p-1)  dimensions. 
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Hydrolosic  variables  are  generally  multiplicative  in 
nature  (VJallis,  I965),  and  plotting  one  variable  against 
another  on  logarithmic  paper  usually  ai^proximates  a  linear 
relationshiip.     For  this  reason,  hydrologists  often  assume 
that  the  dependent  variable  will  be  related  to  the  p  inde- 
pendent variables  in  the  form 

I  =  Bo  Xi'^l  X2^2  X3^3  .   .    .  Xp^P  (2) 

Upon  taking  logaritlims  of  this  equation,  a  linear  equation 
having  the  form 

Log  Y  4-  Log  Bq  4-  B^  Log        +  B^  Log        +  ...  +        Log  X^  (3) 

is  obtained.     If  the  logarithms  of  the  variables  are  used 
as  va.riables,   then  a  linear  multiple  regression  may  be 
performed  that  will  provide  the  coefficients  for  Equation 
(3).     Equation  (3)   is  the  model  used  herein  to  represent 
the  equations  for  peak  discharge  rate  and  total  runoff 
volume.     The  logarithm!  to  the  base  10  of  the  measured 
variable  is  represented  hereafter  by  X^. 

Principal  Component  Analysis  Theory 

The  attempt  in  a  principal  component  analysis  is  to 
find  a  set  of  linear  equations  which  reproduce  the  infor- 
mation present  in  a  set  of  m^easurements  of  several  inde- 
pendent variables.     Hopefully,   the  number  of  linear 
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equations  required  will  be  less  than  the  number  of  vari- 
ables rather  than  several.     These  new  variables  are  refer- 
red to  as  "principal  components,"  or  simply  as  "comiponents" 
in  the  literature.     The  linear  form  of  the  components,  for 

th  th 

the  i-^  value  of  the  j™-  component,  is 


^ji  =  ^ki 

th 

where  x,  .   is  the  i-^ —  m-easurement  of  the  standardized  form 
ki 

of  the  variable  X.^,  or 

^ki     -     ^  . 

^ki    — i;^ —  ^ 
^^k 

th 

and  1,  .  is  an  undo term.ined  coefficient  for  the  k —  vari- 

th  — 
able  and  the  j —  component.     The  other  terms,  and 

are  the  mean  and  standard  deviation,  respectively,  of  the 
th 

k^~  variable.     The  purpose  of  standardization  is  to  give 

the  standardized  variables  a  m.ean  of  zero,  and  a  variance 

equal  to  unity.     For  n  observ8.tions  of  the  standardized 

variable  x,  ,  the  mean  is 
k 


1 


n  ki 


and  the  variance  is 


X,  .=0;       k  =  l,2,...,p  (6) 


H  fl  ^ki^  =  k  =  l,2  p  (7) 

i=l 


33 

standardization  does  not  change  the  importance  of  the 
variables,  and  the  x-^  terms  are  now  dimensionless  because 
the  standard  deviation  has  the  dimensions  of  the  variable. 
Therefore,   if  the  variables  are  standardized  before  the 
analyses  are  begun,   the  effects  of  the  different  dim.ensions 
of  each  variable  are  not  present,  and  better  interpreta- 
tions are  possible. 

To  solve  for  the  1^^  values  in  Equation  (^),  two 
conditions  must  be  satisfied.     First,   it  is  desired  that 
the  components  be  statistically  uncorrelated  (truly  inde- 
pendent) so  that  a  regression  of  the  components  may  be 
performed  without  violating  the  independence  assumption. 
This  means  that  the  "correlation  coefficient"  for  any  two 
components  must  equal  zero.     The  correlation  coefficient  is 
a  statistical  measure  of  the  degree  of  linear  association 
between  two  variables.     The  value  of  the  coefficient  ranges 
between  plus  one  and  minus  one,  depending  on  the  increase 
or  decrease  of  one  variable,  respectively,  as  the  other 
variable  is  increased.     A  correlation  of  plus  or  minus  one 
would  mean  that  the  variables  were  perfectly  related,  and 
a  line  plotted  on  a  two-dimensional  graph  would  intersect 
every  point.     For  any  two  components  to  be  uncorrelated, 


3^ 

must  be  satisfied,  vrhere  r  is  the  correlation  coefficient, 
and  b  and  c  are  the  subscripts  of  the  respective  compo- 
nents . 

The  second  condition  for  solution  for  the  1,  .  values 
in  Equation  (^0  is  that  the  first  component  must  reproduce 
a  maximujn  amount  of  the  inf orma-tion  in  the  original  meas- 
urements of  the  variables.     A  statistical  measure  of  the 
informa-tion  contained  by  a  component  is  the  variance  of  the 
component.     Because  the  variables  are  standardized,  they 
each  have  a  variance  of  unity,  and  the  total  variance  is 
equal  to  p,   because  there  are  p  independent  variables.  If 
the  first  few  components  are  to  reproduce  the  information 
in  the  m.easurements  of  the  varia-bles,   then  the  sum  of  the 
variances  of  the  components  must  also  equal  p.  Therefore, 
the  component  which  has  a.  m_aximum  variance  is  desired.  To 
determine  the  component,   it  is  necessary  to  write  the 
equation  for  the  variance  of  any  component  in  terms  of  the 
unknow^n  1,  .  values,  and  then  equate  the  partial  differen- 
tials  of  this  equation,  with  respect  to  Iv ^ ,   to  zero. 
This  procedure  gives  the  1,  .  values  which  maximize  or 
minimiize  the  variance  of  the  first  component. 

Kendall  (1957,  p.  1^)   takes  the  partial  differentials 
of  the  equation  for  the  variance  of  a  component  and  finds 
the  solution  T'lhich  minimizes  or  maximizes  the  variance. 
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If  T is  the  correlation  of  any  two  vs.riables  defined  by 


n 


jk  "  "n  ^  ^ji  ^ki 


(9) 


and  if  L  is  an  "undetermined  multiplier"   (see  Appendix  A 
for  a  graphical  derivation),   then  the  criterion  for  the 
extremum  is  the  determinant  (Kendall,  p^  15) 


(1-L)  r 


12 


13 


Ip 
'2p 


=  0 


(10) 


V     V     ^p3    •    •    •  ^^-^^ 


If  I  is  defined  as  an  identity  matrix,  then  the  determinant 
can  be  in-itten 


R  -  LI     =  0 


(11) 


in  matrix  notation.     The  matrix  of  correlation  coefficients 
of  the  standardized  independent  variables j  R,   is  easily 
computed  by  applying  Equation  (9)   to  all  combinations  of 
variables,  and  L  is  the  only  unknown. 

Equation  (11)  is  referred  to  as  the  "cha.racteristic" 
equation  of  the  correlation  matrix.  Several  solutions  to 
this  equation  are  available  (Cooley  and  Lohnes,  1962;  Ral- 


ston,  i960)  which  yield  the  L»s  and  a  set  of  l.^j  values 
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for  each  L.     Also,   there  are  generally  p  values  of  L  which 
satisfy  Equation  (11),     These  values  are  called  "eigen- 
va.lues,"  "latent  roots,"  "characteristic  roots,"  "proper 
values,"  or  "characteristic  values"  by  various  authors.  The 
term  "eigenvalues"  is  used  herein. 

The  set  of  1^^  values  corresponding  to  any  L  is  knovrn 
as  the  "eigenvector"  for  that  eigenv8,lue.     If  each  1,  , 
value  in  an  eigenvector  is  divided  by  the  square  root  of 
the  sujn  of  squared  1,  .  values,   then  a  "normalized  eigen- 
vector"  results.     Some  authors  (Kendall,  1957)  call  the 
normalized  eigenvectors  "principal  components,"  and  this 
notation  is  used  herein.     Other  authors  (Eiselstein,  19^7; 
Harman,  19^7)  multiply  the  normalized  eigenvector  "loadings" 
by  the  square  root  of  the  respective  eigenvalue  and  call 
this  new  vector  the  principal  component,  or  "principal 
factor."     In  the  present  analysis,  a  principal  component 
is  identical  to  a  normalized  eigenvector,  and  a  factor, 
whenever  cited,  is  meant  to  be  defined  as 


A.  = 


L.  V.     .  (12) 


where  the  normalized  eigenvector,  V.,   is  defined  by  Equation 

(^) ,  and  L.  is  the  eigenvalue  corresponding  to  the  j— 
3 

eigenvector.     The  1,  .  values  in  Equation  (^)  are  coeffi- 
cients  of  normalized  eigenvectors. 
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Kendall  (1957»  p.  15)  shows  that  the  variance  of  a 
component  is 

n        p  2 
Variance  (V  )  =       H    (H    Iv.         )  (13) 
J  i=l    k-1  ^ 

which  is  also  the  component's  eigenvalue.     This  means  that 
the  component  having  the  largest  eigenvalue  reproduces  a, 
maximiijn  amount  of  the  information  contained  in  the  inde- 
pendent variable  correlations.     The  component  having  the 
second  largest  eigenvalue  reproduces  the  next  largest 
amount  of  the  information,  etc.,  for  all  p  components. 

The  sum  of  the  eigenvalues  for  a  "symmetric"  matrix, 
such  as  the  correlation  matrix  used  herein,  equa.ls  the  sum 
of  the  principal  diagonal  entries  (Hotelling,   1933»  P.  ^29). 
Because  a  principal  component  analysis  uses  the  self-corre- 
lations of  the  variables  in  this  diagonal,   then  this  sum 
is  equal  to  p,  because  p  independent  variables  are  used. 
Also,  as  sho^'m  earlier,   the  total  variance  in  the  stand- 
ardized independent  variables  is  equal  to  p,  and  the  eigen- 
values are  the  variances  of  each  component.     The  total 
variance  of  the  components  is  equal  to  the  total  variance 
present,  and  the  components  therefore  contain  all  the 
information  in  the  correlation  matrix,  which  in  turn  con- 
tained the  informa.tion  in  the  measurements  of  the  data. 
Also,   the  variance  of  each  component  is  the  portion  of  the 
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total  infornation  reproduced  by  the  component.     The  per 
cent  of  the  total  variance  "accounted"  for  by  any  compo- 
nent is  therefore  given  by 

fo  Variance  =  --i.  (lOO)  {1^) 
P 

Earlier,   it  T-ras  stated  that  as  few  components  as 
possible  T'jere  desired  to  account  for  the  total  variance 
in  the  independent  variables.     Because  each  successive 
component  accounts  for  a  m.aximuin  amount  of  the  rem^aining 
variance  in  the  origin-al  variables,   then  some  of  the  later 
components  can  be  expected  to  have  small  eigenvalues. 
Originally,   there  are  p  components  accounting  for  100  per 
cent  of  the  variance.     If,  for  example,  30  independent 
variables  vrere  under  inves tiga-tion,  then  30  components 
would  be  computed,     Kox-rever,   if  10  of  the  components  ac- 
counted for  90  per  cent  of  the  variance,  then  20  components 
could  be  dropped  if  the  10  per  cent  loss  in  information 
was  justified  by  the  reduction  in  variables  for  regression. 
The  regression  equation  would  contain  the  30  original 
variables,  but  woLild  be  performed  on  the  10  uncorrelated 
components  and  the  dependent  variable,  and  not  on  the 
30  independent  variables.     The  resulting  equation  should 
be  a  better  prediction  equation  than  one  obtained  from 
ordinary  multiple  linear  regression  with  all  30  variables, 
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because  the  regression  assuunption  of  independence  would 
not  be  violated. 

No  analytical  method  of  determining  which  components 
are  important  could  be  found  in  the  litera.ture.     Only  sub- 
jective observations  of  the  values  have  been  used.  Kendall 
(1957)  uses  only  those  components  whose  eigenvaJ.ues  are 
significantly  larger  than  others.     In  one  example,  he  per- 
formed a  principal  component  analysis  on  five  variables  and 
arrived  at  five  eigenvalues:     3.2^70,   1.2753,  0.3859, 
0.0700,  and  0.0213,   totaling  to  5. 0000.     Because  of  the 
large  difference  betvreen  the  third  and  fourth,  he  used  only 
the  first  three  components  in  his  regression  equation  for 
beer  consujaption,  accounting  for  98  per  cent  of  the  orig- 
inal vari8.nce.     Cooley  and  Lohnes  (I962,   p.   I60)   state  that 
if  unities  were  used  in  the  principal  diagonal  of  the  cor- 
relation matrix,   then  only  those  components  whose  eigen- 
values are  greater  than  unity  should  be  used  in  future 
analyses.     V/allis  (I965)  recommends  the  use  of  as  many  new 
variates  as  are  needed  to  account  for  as  high  as  99.5  per 
cent  of  the  variance,   thereby  retaining  most  of  the  orig- 
inal variance.     Other  a.uthors  are  aware  of  this  problem, 
but  are  not  specific  in  their  methods  of  solution.  Ander- 
son (1967)  agrees  with  Kendall's  method  of  observing  a 
"large  change"  in  the  eigenvalues.     Kotelling  (1933,  P.  ^21) 
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suggests  that  the  components,  "whose  contributions  to  the 
total  variance  are  small"  be  neglected.     Eiselstein  (I967) 
neglected  those  components  (dimensions)  whose  contributions 
to  the  total  variance  were  less  than  one  per  cent.  Because 
of  the  relatively  small  use  of  principal  component  analyses 
to  date,  no  mathematical  method  for  determining  which 
variates  to  use  has  yet  been  established.     According  to 
the  literature,   the  investigator  must  analyze  his  own 
results  for  the  solution  to  this  problem. 

In  the  above  paragraphs,  it  has  only  been  sta-ted  that 
the  components  derived  are  uncorrelated,  or  that  Equation 
(8)  applies.     Kenda-ll  (1957»  p.  1^)  proves  that  the  cor- 
relation of  any  tiro  components  is  zero,  and  that  all  the 
components  are  perpendicular,  form^ing  a  "p-dimensional 
reference  f rame o "     For  this  reason,  principal  components 
are  sometimes  referred  to  as  "principal  axes," 

The  principal  components  provide  a  means  of  deriving 
uncorrelated  variates  for  multiple  regression.  Graph- 
ically, they  are  reference  axes  dra'wn  through  the  meas- 
urements of  the  independent  variables  so  that  a  maximum 
amount  of  the  information  contained  by  the  measurements 
is  explained  by  the  components.,     However,  no  information 
about  the  relative  importance  of  the  original  variables 
is  presently  available,  and  this  is  the  purpose  of  the 
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next  sectioiij  vrhich  presents  varimax  rotation  theroy. 

Va^_i ma x_ _ Rotation  Theory 

Varimax  rotation  of  normalized  factors  from  a  factor 
anal2/sis  was  originated  by  Kaiser  (1953).     Kaiser  (1959) 
developed  a  computer  program  to  perform  the  rotation, 
A  gra.phical  explanation  of  factor  rotation  uas  not  pre- 
sented by  Kaiser,  a-nd  is  therefore  presented  in  Appendix  B. 

The  set  of  axes  through  the  data,  points  v;hich  gives 
the  best  interpretation  of  the  contribution  of  each  inde- 
pendent variable  to  the  information  is  accomplished  with 
varimax  rotation  by  the  rotation  of  the  principal  axes 
discussed  previously,     if  the  b,  .  values  a.re  the  desired 
coefficients  for  the  rotated  principal  factors,  and  if 
the  a^^^  values  are  the  coefficients  from  Equation  (12), 
then  the  variance  of  the  k—  rotated  factor  is  given  by 
Kaiser  (1959)  as  . 


(15) 


which  he  calls  the  "varimax  criterion"  for  maximura  inter- 
pretation (see  Appendix  B  for  a  graphical  interpretation). 

.  Because  the  a-^^  values  are  knovm,  and  because  the  b^j 
values  are  geometric  functions  of  the  a.  •  values  and  the 
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angle  of  rotation  of  the  principal  axes,  0,  which  maximizes 
the  criterion  J   then  the  a.ngle  is  the  only  unknoim,  and  , 

~  "ki  ®  sin  9  (l6a) 

for  the  first  and  second  factors  (Kaiser,  1959).  The 
desired  anyle  in  the  plane  of  these  factors  can  be  obtain- 
ed by  substituting  b.  •  va.lues  from  Equations  (l6)  into 
Equation  (15)  t  differentiating  xTith  respect  to  6,  and 
solving  for  the  angle  which  makes  the  differential  equal 
to  zero.     After  t\ie  angle  is  found,   the  values  of  b-^^  and 
b^2  ^^'^        computed  from  Equations  (16),   giving  the  loadings 
of  the  rota.ted  factors.     If  these  single-plane  rotations 
are  ma,de  for  all  combinations  of  two  factors,   then  the 
complete  set  of  b^^j  values  can  be  found.     Substitution  ' 
of  these  values  into  Equation  (15)  will  give  a  value  of 
the  criterion  which  should  be  a  maximum,  and  the  b,  .  values 
are  the  coefficients  relating  the  rotated  factors  to  the 
independent  variables.     The  conditions  for  a  maximum,  of 
Equation  (15)  are  found  by  Kaiser  who  takes  the  second 
derivative  of  Equation  (15)  with  respect  to  9,  and  solves 
for  the  limits  on  9  which  make  the  derivative  negative. 
If  9  for  any  single  plane  rota.tion  is  not  within  these 
limits,   then  the  angle  is  equated  to  the  nearest  lim.it, 
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and  a  second  .rotation  is  made  from  this  point.     This  proc- 
ess vrill  converge  to  the  solution  for  Equation  (15)- which 
is  a  maximum  (Ks^iser,  1959)  i  siving  the  desired  b^j  values. 

Because  the  perpendicularity  of  the  factors  is  main- 
ta.ined  by  Kaiser's  normal  varimax  rotation,   the  final 
factors  are  uncorrelated  and  provide  independent  variates 
for  regression.     The  principal  components  are  also  perpen- 
dicular, but  give  no  information  about  the  variables  because 
the  coefficients  are  not  derived  for  this  purpose.  If 
certain  variables  are  not  highly  correlated  T-rith  any  of 
the  rotated  factors,   then  they  do  not  contribute  to  the 
information  contained  by  the  factors,  and  they  may  be 
deemed  unimportant  to  the  information.     Also,  each  rotated 
factor  accounts  for  a  percentage  of  the  total  inf orm.ation, 
and  if  some  factors  are  not  important,   then  any  variables 
correlated  only  with  these  factors  are  also  unimportant. 
Certain  independent  variables  may  therefore  be  regarded 
as  unimportant  and  excluded  from  the  regression  analysis 
of  the  rotated  factors.  ,  , 

Multiple  Regression  of  Principal  Components_  and  Rotated 
Factors  . 

A  principal  component  analysis  could  be  performed 
with  no  rotation  for  interpretation  of  the  variables,  if 
the  independent  variables  used  were  all  known  to  be 


important,     A  multiple  regression  of  the  principal  com- 
ponents would  give  an  equation  for  the  dependent  variable 
in  terms  of  all  the  independent  variables  with  no  regres- 
sion assum,ption  violations.     If  this  was  the  only  purpose 
of  the  analysis,   then  one  of  two  methods  could  be  used 
to  obtain  the  regression  coefficients  of  the  independent 
variables. 

One  method  of  obtaining  the  coefficients  for  the 
principal  components  is  to  solve  the  equations  of  the 
components  using  all  the  measurements  of  the  original 
independent  variables,  giving  n  sets  of  values  for  the 
s  components,  one  set  for  each  measurement  of  the  dependent 
variable.     The  log- transformed  data,  from  one  measurement 
of  all  the  variables  is  substituted  into  Equation  (^) 
for  all  s  components,  giving  one  set  of  the  new  independ- 
ent variates  for  the  respective  measurement  of  the  depend- 
ent variable.     This  is  repeated  for  all  observations, 
and  an  ordinary  linear  multiple  regression  of  the  depend- 
ent variable  and  the  values  for  the  components  is  per- 
formed.    This  gives  the  regression  constant  and  coeffi- 
cients for  the  equation  of  the  i—  observation 

=     +  i      v.^  (17 

V7here  the  F.  values  are  the  regression  coefficients  of  the 
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components,  and  y  is  the  standardized  dependent  variable. 
Substituting  Equation  (^)  for  V..   gives  the  regression 
equation  in  terns  of  the  original  standardized  independent 
variables 

=  ^    ^1  51    \.  X  (18) 

If  y  and        are  the  standardized  form  of  the  logarithms 
of  the  original  variables,  then 


Log  Xi^j_   -  Log  X;^ 

X      =  1—  .   (19) 

Log  X-^ 


th 

where  X,  .   is  the  i —  measurement  of  the  variable  X-,  - 
Substituting  this  and  a  similar  equation  for  y  into  Equa- 
tion (18)  gives  the  regression  equation  in  term.s  of  the 
measured  varia/oles  - 

P 

Log  Y  =  H          Log  X,    +  C  (20) 
k=l  ^ 


Equation  (20)   is  valid  only  if 


s 

B,  =  s_     ^YL  (P. 

k        Log  Y  J 


^Log  X-^ 


and  if 


C  =  s  F    +  Log  Y  -  s  H    H  (F    ^—  •  Log  X  ) 

Log  Y     0  Log  Y  v_i      3  ^r^m  V  k 


_    H  (F.  — — 

j=l  k=l     J  ^Log  X^ 
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Equation  (20)  reduces  to  the  form  desired,  given  by  Equa- 
tion (2)  as 

Bo      Bo  B 
Y  =  Bq        ^  ^  .    .   .  Xp  P  ■  (2) 

if 

Bq  =  Antilog(C) 

Because  the  original  data  must  be  used  twice  for 

the  computation  of  the  coefficients  in  Equation  (2); 

once  for  the  correlation  matrix  computation,  and  once 

in  the  regression  analysis,  a  better  method  of  obtaining 

the  coefficients  with  fewer  comput3.tions  may  be  used. 

Kendall  (1957)   shows  that  the  regression  coefficients  for 

the  principal  components  (normalized  eigenvectors)  can 

be  computed  from  the  eignevalues  by  using  the  equation 

P  , 

^/kj  ^ky 
F.  =  JL:i_ — (21) 

r     ^3  r 

which  C3,n  be  derived  using  multiple  regression  theory. 

The  values  of  r      are  the  correlation  coefficients  of  the 

ky 

th 

k—  standardized  independent  variable  with  the  dependent 
variable  y,  and  are  Imown  from  the  correlation  analysis. 
The  other  terms  are  defined  earlier,  and  the  F.  values 
can  be  computed  without  returning  to  the  original  data. 
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Once  the  F.  values  are  found,  Equation  (2)  gives  the  de- 
0 

sired  relationship. 

A  simplified  method  for  multiple  regression  of  ro- 
tated factors  could  not  be  found  in  the  literature,  but 
a  method  exactly  the  same  as  the  method  indicated  by 
Equation  (1?)  for  principal  component  regression  nay  be 
employed.  Because  the  rotated  factors  are  given  by  the 
equation  (see  Appendix  B) 


A..     =  II    c,  .  X,  .  (22) 


where  the  prime  indicates  a  rotated  factor,  and  the  c,  . 
valu.es  are  the  factor  loadings,   then  Equations  (1?)  through 
(20)  may  be  applied  to  the  rotated  factors,  giving 

s 


y. 


=  G„  +   X    G,  A,    *  ,  .  (23a) 


0       fli      J  ji 


S  Ci 
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'Log  Y   -i^^j  s 


\  =  ^  J:  (G,   )  (23b) 


C  =  ^        +  Log  Y  -  H     i:  (G.   ^ —  Log  X,  ) 

Log  Y     0  Log  Y  .^t-l     J  s^^^  k 

where  the  G.  values  are  from  an  ordinary  linear  multiple 
3 

regression  of  the  rotated  factors  and  dependent  variable, 
using  only  those  independent  variables  deemed  important. 


^8 

Unlike  the  component  regression,  an  equation  for  the  de- 
pendent variable  in  terms  of  the  important  independent 
variables  is  obtained  v;ith  Equation  (23a).     This  equa.tion, 
or  the  one  obtained  from  a  principal  component  regression, 
should  give  reasonable  predictions  of  the  dependent  vari- 
able, and  should  exhibit  sensible  signs  and  magnitudes 
for  the  coefficients  of  the  variables. 

SuTnmary 

The  methods  reviewed  herein  begin  with  the  computa- 
tions of  the  correlations  of  all  the  standardized  vari- 
ables, dependent  and  independent.     This  is  followed  by 
the  principal  component  analysis  of  the  correlations  among 
the  independent  variables.     The  components  are  then  con- 
verted to  factors  which  are  rotated,  allowing  the  inter- 
pretation of  the  importance  of  the  independent  variables 
to  the  information  contained,  in  the  original  measurements. 
If  the  measurements  of  all  the  independent  variables  may 
be  obtained,   then  the  principal  component  regression  equa- 
tion provides  the  best  prediction.     Otherwise,  an  adequate 
regression  equation  from  the  rotated  factors  and  impor- 
tant independent  variables  may  be  used. 


Chapter  IV 
ANALYSIS  OP  DATA 


The  procedures  developed  in  Chapter  III  were  applied 
to  the  watershed  data  in  the  order  discussed  in  that  chap- 
ter.    The  methods  used  and  results  obtained  in  recording 
the  data,   selecting  the  variables,  computing  the  values  of 
the  variables,  and  analyzing  the  values  are  presented  in 
this  chapter. 

DATA  RBCORDED 

The  measurements  of  29  independent  variables  and  two 
dependent  variables  for  50  runoff  events  viere  obtained 
from  continuous  data  taken  from  five  central  and  eastern 
Montana  vjatersheds.     The  watersheds  were  selected  and 
instrumented  in  19^3  (VJilliams,  I965).     Residents  living 
on  or  near  the  watersheds  were  consulted,  and  a  system 
for  instrumentation  of  the  watersheds  was  established  by 
those  residents  who  agreed  to  service  and  maintain  the 
instruments.     Table  I  gives  the  watershed  names,  locations, 
sizes,  and  instrumentation  numbers.     The  weather  stations 
consisted  of  instrujments  which  measured  and  recorded,  at 
half-hour  intervals,  the  following  information:  soil 
moisture  (per  cent  on  a  dry  weight  basis)  and  temperature 
(deg  F)  at  3»  9,  and  18  inches  belovj  ground  surface;  air 
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Table  I.     Ins triunents 

Located 

on  Vfatersheds 

*}"  <^  T*  CI  Vi  ^  rl       T .  n  r*i       TvT  o.  y-i 

Coulee 

UcX,  O  <_/X  J. 

Creek 

T-T  n  m  T\ 

Creek 

Creek 

Duck  Creek 

land 

grass 

Praire 

X  X  c3.  X  X  C 

Atp;^                            1  i^l  10 

(sq  mi) 

1  7  Q7 

70 

9^  •  r  7 

1  ;^7 

R  n  -n  n  f  -P                                9  Ti 
n  Ui  lO  11                                /C  U 

Events 

h 

J 

1  n 

X  U 

J- J) 

Rain  Gages 

J 

1 

it 

Al 
*+ 

Rain  Gages 

-5 
-? 

Snow  3 
Courses 

3 

3 

3 

3 

V/eather  2 
Stations 

2 

1 

3 

3 

Wind  1 
Stations 

1 

1 

1 

1 

V/ater- stage  1 

1 

1 

2 

1 

Recorders 


Some  x^eather  stations  did  not  have  instruments  for  meas- 
uring vrind  speed  and  direction.  . 

(The  instrument  stations  for  East  Fork  Duck  Creek  were  the 
same  as  for  Duck  Creek). 


temperature  (deg  F)  at  h  and  10  feet  above  the  surface; 
and  wind  speed  (mph)  and  direction  at  10  feet  above  the 
surface.     United  States  Geological  Survey  (U3G3) 
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Quadrangle  maps  and  some  field  slope  measurements  provided 
the  measLirements  of  the  vratershed  topographic  variables. 
The  U.S»  Soil  Conservation  Service  (3C3)  performed  surveys 
and  prepared  mps  showing  the  types  and  proportions  of 
the  soils  on  the  watersheds.     Measurements  of  infiltra- 
tion rates  for  most  of  the  soils  were  made  by  project 
personnel  during  the  summer  of  I966,     The  USGS  obtained 
cross-section  and  velocity  measurem.ents  near  the  water- 
stage  recorders  for  some  of  the  runoff  events,  yielding 
data  for  the  construction  of  stage-discharge  curves. 
Aerial  photogra^phs  of  the  wa.tersheds  provided  information 
about  the  land  use  and  snow  coverage » 

Data  from  the  continuous  water-stage  recorders  were 
reduced  to  hydrographs  relating  discharge  rate  to  time, 
using  the  stage-discharge  relationships  for  each  recorder. 
These  hydrographs  provided  the  criterion  for  selection  of 
the  runoff  events  to  be  used.     Whenever  the  peak  discharge 
rate  was  larger  tha.n  10  cfs,  a  starting  and  an  ending  time 
of  discharge  were  established,  giving  the  runoff  hydrograph 
for  that  event.     Generally,   there  was  no  base  flow  before 
the  event,  and  the  hydrogrs.ph  was  terminated  when  the  dis- 
charge rate  reduced  to  zero  flow.     This  procedure  estab- 
lished the  50  peak  discharge  rates,  and  the  measurements 
of  the  second  dependent  variable,   total  runoffs  were 
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determined  from  the  area  under  each  hydrograph, 

g^LECmM  OF  INDEP til^m^WT  V^:^! ABLS3 

Measurements  of  29  independent  variables  for  each 
runoff  event  were  computed .     The  variables  are  presented 
in  Table  II ,  and  this  is  follovjed  by  the  reasons  for  the 


Table 

II.     Independent  Variables  Studied 

Var , 

var . 

vJatershed  and  Storm  >/aria,ble  Defini-cions 

Name^ 

&  Units  oi  Measurement 

1 

A 

Area  (sq  mi) 

2 

SHP 

Shape  (dimensionless ) 

3 

AZ 

Azimutn  (deg) 

ELEV 

Elevation  (ft) 

5 

GNDS 

Ground  slope  (ft/ft) 

6 

GiTDL 

Overlana  distance  of  ilow  (mi) 

7 

Stream  frequency  (1/sq  mi) 

8 

L 

Main  chanjiel  meander  length  (mi) 

9 

S 

Ma-in  channel  straight-line  slope  (ft/ft) 

10 

USE 

La.nd  use  ratio  (dimensionless ) 

11 

INFR 

Soil  infiltration  rate  (in./hr) 

12 

POND 

Per  cent  ponds  and  reservoirs  (;o) 

13 

I 

Precipitation  intensity  (in./hr) 

1^ 

ISD 

Standard  deviation  of  intensities  (in./hr) 

15 

D 

Duration  of  storm  (hr) 

16 

TDF 

Time  distribution  of  precipitation  (hr) 

17 

TPC? 

Total  precipitation  (in.) 

18 

API 

1^-day  a.ntecedent  precipitation  index  (in.) 

19 

30LM 

Soil  moisture  {fj) 

20 

WDIR 

Wind  direction  (dimensionless) 

21 

V/SEK 

Neek  of  the  year  (dimensionless) 

22 

AIRT 

Mea,n  air  tempera tuj:'e  (deg  F) 

23 

AT3D 

Standard  deviation  of  air  temperatures  (deg 

WEL 

Mean  irind  velocity  (mph) 

25 

WSD 

Standard  deviation  of  x^ind  velocities  (mph) 

26 

SOLT 

Mean  soil  temperature  (deg  F) 

27 

STSD 

Standard  deviation  of  soil  tempera. tures  (de 

28 

DEGD 

Degree-Days  (deg  F) 

29 

SVrEQ 

SnoxT-T-rater  equivalent  in  sno'/rpack  (in.) 
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selection  of  these  particular  variables.  Detailed  descrip 
tions  of  the  variables  are  listed  in  Appendix  C. 

A  feu  comments  concerning  the  detailed  descrip- 
tions of  the  varie.bles  should  be  made  before  proceeding 
with  the  discussion  of  the  analysis.     The  Thiessen  areas 
for  the  various  types  of  "weigh ting"  were  deterrained  from 
rcTDeated  planime tering  of  reduced  SCS  soil  maps  of  the 
watersheds.     The  area  attributed  to  each  weathej?  station, 
snow  course,  or  precipitation  station  was  determined  by 
drawing  perpendicular  bisectors  of  the  lines  connecting 
the  stations.     The  bisectors  were  intersected  with  each 
other  and  with  the  xmtershed  boundaries,  forming  the  poly- 
gon for  each  station. 

Because  contoured  topographic  naps  xfith  small  contouj? 
intervals  were  not  available  for  the  Duck  Creek  and  East  ; 
Fork  of  Duck  Creek  watersheds,  nor  for  the  Bacon  Creek 
watershed,   trans it- stadia  surveys  of  the  main  channels  • 
of  Duck  Creek  and  East  Fork  of  Duck  Creek  were  performed 
to  establish  the  vertical  properties  of  these  watershed 
channels.     The  slopes  obtained  from  the  survey  varied  by 
less  than  10  per  cent  from  the  slopes  obteined  from  the 
contoured  USGS  maps,     A  careful  anal3/-sis  of  these  maps  . 
vrill  therefore  yield  satisfactory  values  for  the  slope 
variables,  GMD3  and  S.     For  this  reason,  a  survey  of  the 
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Bacon  Creek  watershed  T^ras  not  conducted,  and  a  similar 
large -interval  map  was  used. 

The  variables  measuring  ground  slope  and  overland 
distance  of  flow  were  measured  in  the  manner  chosen  be- 
cause of  the  difficulty  of  other  methods  of  representing 
these  parameters.     Some  investigators  use  the  method  em- 
ployed here  J  while  others  measure  a  random  number  of 
distances  and  slopes  from,  ridges  to  cha,nnels,  and  average 
these.     Because  the  steepest  portions  of  watersheds  are 
generally  in  the  upper  reaches,   the  method  employed 
attempts  to  establish  the  steepest  and  shortest  path  for 
the  overland  flow. 

Several  methods  of  measuring  the  population  of  stream 
segments  on  a  watershed  can  be  foujid  in  the  literature 
(Chow,   196-^)  .     The  method  chosen  represents  the  nujnber 
of  segments  per  unit  area  of  the  wa,tershed,  and  there- 
fore has  a  physical  meaning.     A  large  value  for  this  var- 
iable would  indicate  that  the  watershed  exhibits  a  large 
number  of  branching  streams  a-nd  tributaries. 

Because  aerial  photographs  were  not  available  for 
each  year  of  data,  recorded,   the  l3,nd-use  variable  remained 
constant  for  each  runoff  event.     Other  forms  for  this 
variable  have  been  suggested,  but  the  available  data  was 
not  sufficient  to  warrant  the  use  of  any  of  them.  The 
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j.ndex  chosen  increases  with  the  vegeta.tion  on  the  water- 
/ihed,  and  therefore  has  a  physical  interpretation. 

The  infiltration  variable  was  chosen  to  measure  an 
'■'.verage  infiltration  rate  for  the  different  soil  types. 
h  single  number  for  each  watershed  is  not  altogether 
reasonable  if  many  soil  types  are  present,  but  the  agree- 
ment of  the  average  infiltration  rates  on  each  of  the 
'-hree  SC3  soil  types  with  published  data  v^as  encouraging. 
Table  III  compares  the  measured  values  with  the  suggested 
ranges  from  the  literature. 

'I'able  III.  Measured  and  Published  SC3  Infiltration  Rates 
^^'-2§-.A9.y:^TZP^  ^ZS„*^il-5..J^^il4jj'_        P ubl i she d^ 


^^From  Chow  (196^}-)  p.  12-26. 

The  pondage  variable  x/as  not  easily  obtained,  and 
luay  not  be  indicative  of  the  storage  w^hich  takes  place 
•luring  a  runoff  event.  The  purpose  of  using  this  var- 
iable was  to  indicate  the  possible  storage  capacity  of 
the  watersheds.  Although  the  depths  of  the  reservoirs 
'•we  not  included  in  the  variable,  the  surface  area  is 
frequently  found  to  be  related  to  depth,  and  both  may 


B. 
C 
D 


0.129 
0.09^ 
0.0^0 


0.15  to  0.30 
0.05  to  0.15 

0.00  to  0.05 
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be  measured  in  the  same  variable. 

The  remaining  variables  vrere  chosen  because  of  their 
frequent  use  in  similar  studies      The  determination  of  values 
of  all  the  variables  vras  time-consujnins  because  cojnputations 
of  some  of  the  variables  required  analyses  of  large  amounts 
of  data^     The  development  of  good  prediction  equations 
usually  requires  easil^^-obtained  variables,  and  this  fact 
was  also  considered  TThen  choosing  the  varia.bles.     Table  IV 


Table  IV. 

Siumary 

of  IJate 

rshed  Characteris ti 

cs 

Variable          Lone  Man 

Coulee 

Bacon 
Creek 

Hump 
Creek 

Duck 
Creek 

E. P. Duck 
Creek 

A  ( sq  mi ) 

1^.1 

17.97 

7.61 

53.79 

13.67 

SHP  (mi/mi) 

^.332 

2.569 

2.721 

3.684 

AZ 

258 

289 

177 

l6i^ 

138 

ELEV  (ft) 

3916 

^578 

2853 

2862 

GND3  (ft/ft) 

0.019^ 

0.0162 

0.0650 

0.0596 

0.0763 

GNDL  (mi)  . 

0.878 

0.516 

0.239 

0.820 

0.380 

FREQ  (1/sq  mi) 

26.66 

2^1-. 18 

1^.05 

15.80 

L  (mi) 

8.  ^^6 

12.895 

6.4^^ 

23.61^ 

13.051 

S  (ft/ft) 

0.00639 

0.01182 

0.02i!'12 

0.00596 

o« 00692 

USE  ifo)  • 

1.35 

75.18 

^5.6^ 

9.67 

10.51 

IllFR  (in/hr) 

0.09^81 

0.0882^f 

0.10155 

0.10022 

0.09689 

POND  {%) 

0.02352 

0.00915 

0.00000 

0.00614 

0.02887 
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summarizes  the  physiographic  variables,  which  were  assumed 
to  remain  constant  for  each  watershed.     The  values  of  the 
storm  variables  are  not  tabulated  herein  but  are  available 
in  the  Department  of  Civil  Engineering  and  Engineering 
Mechanics,  Montana  State  University.     The  means  and  standard 
deviations  of  the  storm  and  physiographic  variables  are 
listed  in  Appendix  H  for  the  raw  and  log- transformed  vari- 
ables. 

Operational  and  mechanical  problems  with  the  instru- 
ments during  the  times  of  interest  caused  some  missing 
values  for  the  storm  variables.     At  least  one  recording 
precipitation  station  wa,s  operational  during  each  of  the 
storms,  and  hence  no  information  was  lost  for  the  precip- 
itation variables.     Thiessen  areas  for  only  those  stations 
which  were  recording  were  used  whenever  some  stations 
were  non-operative.     The  greatest  losses  of  complete  data 
occurred  from  the  weather  stations.     V/henever  data  w^as 
missing  from,  all  the  weather  stations  on  a  watershed, 
the  average  of  the  variable  computed  for  the  same  time 
period  over  the  other  years  of  data  was  used.     The  one 
exception  to  this  was  the  variable  for  wind  direction. 
Here,   the  time  difference  of  appearance  of  precipitation 
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on  the  recording  precipitation  stations  was  used  to  esti- 
mate the  direction  of  movement  of  the  storm,  and  there- 
fore the  probable  wind  direction. 

The  University  computers  were  used  extensively  in 
the  reduction  of  the  data  for  the  value  of  each  storm 
variable  for  all  the  storms.     Most  of  the  programs  writ- 
ten were  elementary  and  are  not  included  with  this  report. 
One  program  for  computation  of  variables  13,  1^,  15, 
and  17  in  Table  II  is  rather  involved,  and  is  included 
in  Appendix  D,     The  standard  Ueather  Bureau  format  for 
reco?rding  and  non-recording  precipitation  data  w^as  used 
in  punching  the  project  data,  but  operations  on  this  for- 
mat were  found  to  be  difficult.     The  included  program 
could  probably  have  been  much  shorter  if  another  input 
fonnat  had  been  used, 

ANALYS I S_  p_F_  3 1..  VABI ABI.SS 
Correlations 

.    After  the  values  of  the  29  independent  variables 
and  2  dependent  vs,riables  were  computed  for  the  50  runoff 
events,   the  data  were  transformed  by  taking  the  base-10 
logarithms  of  all  the  values.     This  was  done  in  order  to 
use  Equation  (3)  as  the  model  for  the  prediction  equation. 
Zero  measurements  of  any  of  the  variables  were  set  equal 
to  0,00001  before  transformation,   in  order  to  compute 
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logarithms  wit?i  the  computer.     The  correlation  coefficients 
of  each  of  the  transformed  variables  with  all  other  variab- 
les were  then  computed  using  a  modified  version  of  a  cor- 
relation computer  program  from  Cooley  and  Lohnes  (Appendix 
E).     The  means  and  st3.ndard  deviations  of  all  log-trans- 
formed variables  and  of  the  original  variables  were  also 
computed,  and  these  are  listed  in  Appendix  H,  Computations 
for  the  eigenvectors  a,nd  their  respective  eigenvalues 
required  the  correlations  between  the  standardized,  trans- 
formed varis.bles  (Equations  9  and  10),     However,  since 
these  correlations  are  identical  to  the  values  obtained 
for  the  log- transformed  variables,  as  stated  earlier, 
standardization  was  not  employed  at  this  point  of  the 
analysis,  . 

P  r  i  n  c  i  pa 1  C  omp o n e  n t_ Ana lysis 

The  correlation  coefficients  for  the  29  independent 
variables  were  entered  into  a  modified  version  of  a  prin- 
cipal component  computer  program  given  by  Cooley  and 
Lohnes  (Appendix  P),     This  program  computed  the  eigenvalues 
and  normalized  eigenvectors  of  the  correlation  matrix, 
giving  29  principal  components  and  their  respective  vari- 
ances.    Table  V  lists  the  variance,  percentage  of  total 
variance,  and  accumulated  percentage  of  the  first  18 
components.     Equation  (1^)  was  used  in  computing  the  per 
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Table  V»     Properties  of  the  First  18  Components 


Component 

 NiH:^_bex 

Variance 

Per  Cent  of 

Accujiiiilated 

1 

9.625 

33.19 

33.19 

2 

4.772 

16.46 

49.65 

3 

3.778 

13.03 

62.68 

2c323 

8.01 

70.69 

5 

1-730 

5.07 

76.66 

6 

1.385 

4.77 

81.43 

7 

1.025 

3.53 

84.96 

8 

0.961 

3.31 

88.27 

9 

0.806 

2.78 

91.05 

10 

0.684 

2.36 

93.41 

11 

0.430 

1.48 

94.89 

12 

0.410 

1.41 

96.30 

13 

0.278 

0.96 

97.26 

14 

0.214 

0.74 

98.00 

15 

0.180 

0.62 

98.62 

16 

0.140 

0.48 

99.10 

17 

0.114 

0.39 

99.49 

18 

0.072 

0.25 

99.74 

cent  of  the  total  variance  accounted  for  by  each  component, 
where  p  was  the  number  of  independent  variables.  The 
other  eleven  components  contributed  about  equally  to  the 
small  per  cent  remaining,  and  were  considered  unimportant. 

The  criteria  of  Siselstein,  Cooley  and  Lohnes,  and 
Kendall  were  combined  in  determining  which  components  to 
retain  for  further  analyses.     If  only  those  components 
whose  eigenvalues  are  greater  than  unity  (Cooley  and 
Lohnes)  were  used,   then  only  about  85  per  cent  of  the 
inform-ation  in  the  measurements  would  be  retained  by  seven 
components.     There  were  no  obvious  "large"  changes  in  the 
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values  (Kendall),  and  twelve  components  would  have  to  be 
retained  if  those  components  having  less  than  one  per  cent 
of  the  variance  were  discarded.     The  author  decided  to  re- 
tain the  first  ten  components,  losing  about  7  psr  cent  of 
the  information, 

Table's/I  shows  that  none  of  the  29  independent 

Table  VI.  Reduced  Loadings  for  the  First  10  Components 


Component 
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variables  had  small  correlations  vrith  all  10  of  the  impor- 
tant components.     The  reduced  values  in  the  rous  of  the 
table  are  the  correlations  of  the  variables  xiith  the  com- 
ponents.    The  numbered  components  correspond  to  the  first 
ten  components  of  Table  V",  which  gives  the  respective 
variance  of  each.     Because  no  information  about  the  inde- 
pendent variables  could  be  obtained  at  this  point,  the 
components  uere  reduced  to  factors  via  Equation  (12),  in 
prepa.ration  for  rotsition, 

Var imax  R o  ta t i  on  of  Pr i nc  i pa,l  _Fa c  tors 

An  initial  varimax  rotation  of  the  ten  factors  was 
performed  using  a  modified  version  of  a  computer  program 
listed  by  Cooley  and  Lohnes  (Appendix  G).     The  program 
was  modified  to  read  either  components  and  eigenvalues, 
or  factors.     Computations  for  the  variance  of  each  rotated 
factor  were  not  initially  made  by  the  program,  and  state- 
ments for  the  computations  and  output  were  included.  The 
program  was  tested  with  data  from  both  Eiselstein  and  • 
Harman,  and  the  results  agreed  very  well  with  those  obtain- 
ed by  both  investigators. 

The  ten  factors  were  rotated  by  the  program,  giving 
the  new  factors  shown  in  Table  VII.     The  reduced  variance, 
per  cent  of  original  variance,  and  accumulated  percentage 
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Table  Vil,     Properties  of  the  10  Rotated  Factors 
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of  each  factor 

is  included  in 

the 

ta,ble 

,  shoiring 

that 

the 

ten  rotated  factors  account  for  all  the  variance  in  the 


ten  coaponents,  as  expected.  The  loadings  of  the  factors 
are  the  correlations  of  each  variable  with  each  factor, 
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and.  the  largest  correlation  for  each  variable  is  underlined. 
As  shovm,  all  the  variables  had  a  high  correlation  with 
only  one  fs.ctor. 

From  the  loadings  a,nd  the  criteria  in  the  literature, 
the  v-aris-bles  were  examined  for  their  importance  to  the 
rotated  factors.     As  with  the  choice  of  the  important 
components,   there  are  no  analytical  means  of  deteirmining 
which  variables  are  important,  or  which  factors  could  be 
omitted.     The  literature  stresses  the  fact  that  the  impor- 
tant variables  should  be  chosen  with  "one  hand  over  their 
names",   so  that  the  investigator  does  not  force  the  data 
to  indicate  a  wrong  interpretation.     Investigators  of 
multivariate  analyses  tend  to  agree  that  no  more  than  two 
variables  per  factor  should  be  retained,  and  that  loadings 
smaller  than  about  0.^0  can  usually  be  considered  to  be  ■ 
indicative  of  an  unimportant  correlation  (VJa.llis,  I965). 
This  value  should  only  be  considered  as  a  guide  and  not 
as  the  accepted  criterion  for  unimportance. 

An  investigation  of  Table  VII  yields  several  possible 
interpretations,  even  though  the  simple  structure  is 
relatively  clear  because  of  the  single,  large  loading 
on  each  of  the  variables.     Seven  of  the  variables  have  ■ 
comparatively  small  correlations  with  all  the  factors, 
and  five  of  the  remaining  variables  are  highly  correlated 
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with  factors  vjhose  variances  are  siaall.     The  variables 
which  are  not  highly  correlated  with  any  of  the  factors, 
GNDL,  D,  TPCP,  /lPI,   SOLM,  and  STSD,  do  not  contribute 
appreciably  to  the  information  contained  by  the  measure-- 
ments  of  the  original  29  independent  variables.  Because 
the  rotated  factors  "reproduce"  93.^  pe^  cent  of  the  orig- 
inal information,  omission  of  the  seven  variables  listed 
would  give  approximately  the  same  information  contained 
by  measurements  of  all  the  variables. 

Also,  because  of  the  small  variances  of  the  last  four 
or  five  factors,   these  factors  do  not  reproduce  a  large 
portion  of  the  original  information.     The  last  three  are 
determined  by  essentially  one  variable  each,  AT3D,  TDP, 
and  WDIR,  respectively.     These  three  variables  are  highly 
correlated  with  the  last  three  factors,  but  the  factors 
are  not  highly  important  to  the  total  variance,  and  the 
variables  were  therefore  deemed  to  be  relatively  unimpor- 
tant. 

Two  other  variables,  L  and  W73D,  were  important, 
respectively,   to  the  fifth  and  sixth  factors.     The  sixth 
factor  was  classified  with  the  last  three,  and  the  variable 
WSD  was  also  deemed  to  be  unimportant.     The  relatively 
large  change  in  variance  between  the  fifth  and  sixth  fac- 
tors, when  compared  to  the  other  changes,   seems  to  justify 
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the  single  classification  of  the  last  five  factors,  i.e., 
not  nearly  as  inportstnt  as  the  first  five. 

The  variables  L  a.nd  A  are  the  only  two  variables 
important  to  the  fifth  factor.     Beca.use  va.riables  lAjhich 
"coexist"  within  a  factor  are  generally  highly  correlated, 
then  the  exclusion  of  one  yields  essentially  the  same 
inf orination  obtained  when  both  a.re  measured.     Because  the 
variable  A  was  considerably  more  highly  correlated  with 
this  factor,  and  because  the  factor  was  the  least  importan 
of  the  first  five,  L  v/as  also  deemed  unimportant. 

The  interpretation  given  in  the  proceeding  paragraph 
excludes  12  independent  variables.     An  investigation  of 
the  sixth  and  seventh  factors  yields  the  fact  that  all 
the  variables  correlated  xiith  these  factors  have  been 
excluded,  and  the  variances  of  the  factors  are  essentially 
equal.     VJallis  (I965)  warns  against  omitting  all  the  vari- 
ables important  to  the  important  factors,  and  for  this 
reason,   the  "hand"  was  lifted  and  the  variable  TPCP  in 
the  seventh  factor  was  not  omitted  in  further  analyses.  . 
Objectivity,  and  the  frequency  of  appearance  of  this  vari- 
able in  other  investigations  prohibited  its  exclusion  here 
This  essentially  means  that  the  first  six  factors  vfere 
deemed  important,   if  the  sixth  and  seventh  factors  in 
Table  VII  are  reversed.     This  interpretation  left  only  18 
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independent  variables  for  further  analysis. 

Because  a  regression  equation  involvins  18  independent 
variables  is  probably  not  practical,   tvjo  attempts  to  fur- 
ther reduce  the  number  of  variables  vrere  made.     The  first 
involved  the  use  of  only  those  variables,  again  vrith  the 
exception  of  TPGP,  i^rhich  displayed  correlations  higher 
than  an  arbitrary  value  of  0.85  with  the  importa-nt  factors. 
The  second  attempt  took  advantage  of  the  facts  that  the 
"coexistent"  variables  vrithin  a  factor  are  usually  highly 
correlated,  and  that  the  factors  themselves  are  uncorrelat- 
ed.     One  variable  from  each  of  the  first  six  factors, 
using  the  reversed  notation  on  factors  6  and  7  in  Table  VII 
was  retained  to  yield  six  relatively  uncorrelated  variables 
In  the  first  factor,  SOLT  had  the  highest  correlation; 
however,  for  reasons  discussed  later,   the  author  believed 
that  the  variable  having  the  second  highest  correlation, 
I,  would  be  a  better  variable  to  measure  the  information 
contained  by  the  first  factor.     The  variables  having  the 
highest  correlations  with  the  next  five  factors  were  also 
included,  yielding  the  six  most  uncorrelated  variables. 
Table  VIII  shows  the  three  interpretations  of  the  rotated 
factors.     Other  interpretations  were  possible,  but  the 
author  felt  that  these  would  be  adequate  for  the  regression 
analysis  to  follow.     The  second  and  third  are  meant  only 
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to  possibly  reduce  the  nuinber  of  independent  variables, 
and  the  first  is  felt  to  be  the  most  a.ppropriate  interpre- 
tation of  the  rotated  factors. 


Table  ViII» Three  Sets  of  Important  Variables  from  Three 

Interpretations  of  Table  VII 


First  Second  Third 

Interpre  tation  Interpretation  Interpre  tation 
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If  an  interpretation  of  the  most  important  successive 
variables  to  the  information  were  desired,  Table  IX  would 
be  the  best  approach.     As  shown,   the  29  independent  vari- 
ables are  all  present.     The  criterion  used  in  formulating 
this  table  was  the  decreasing  order  of  correlations  of 
the  variables  with  the  factors  from  Table  VII,  with  the 
exception  of  the  first  factor,  where  SOLT  was  placed  third 
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in  line  instead  of  first. 


Table  IX.     Successive  Importance  of  the  Variables 

to  the  Factors 


Fa c  toji  crea^i^n'X o r ta n c e 


1 

•  I 
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2 

AZ 
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S 

5 

A 

L 

6 
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7 

TP  CP 
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8 
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9 
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10 
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D 


After  the  interpretations  of  the  initial  rotation, 
the  ten  original,  unrotated  factors  were  rotated  three 
additional  times,  using  only  the  loadings  on  the  important 
variables  for  each  of  the  three  interpretations.     This  vias 
done  to  refine  the  rotated  loadings  on  the  important  vari- 
ables.    In  each  of  the  three  rotations,   the  coefficients 
did  not  appreciably  change,  but  the  nevr  rotated  factors 
were  computed  to  remove  any  effects  of  the  unimportant 
variables  in  each  case.    All  ten  of  the  original  factors 
were  rotated  to  maintain  a  relatively  high  percentage  of 
explained  variance.     If  only  the  first  six  factors  had 
been  rotated,  less  than  76  per  cent  of  the  information 
would  have  been  retained  (see  Table  VII )e     The  reduction 
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in  the  number  of  variables  used  in  the  rotation  causes 
a  loss  of  information,  but  this  is  not  nearly  as  much  as 
would  have  been  lost  if  combined  with  a  further  reduction 
in  the  number  of  factors.     Table    X   outlines  the  retained 
information  for  the  three  rotations.     The  percentages  given 
are  for  the  original  variance  in  29  variables,  and  are 
therefore  small.     However,   the  retained  variance  of  the 
important  variables  is  large  in  each  case. 


Table   X  .     Variance  Retained  by  Rotations  of  the  Original 

Factors  with  Reduced  Numbers  of  Variables 


Interpretation  Variables  Total  Variance  %  of  Original 
 of  10  FQ-c  tors      _  \^yjr3J}!^P_  

1  18  17.1^  88.9 

2  1^  13.18  87.9 

3  6  5.57  .       -  86.6 

Regression  Analyses 

Four  prediction  equations  having  the  form  of  Equation 
(3)  were  obtained.     The  first  used  the  modified  principal 
component  method,  and  the  other  three  used  computed  values 
of  the  rotated  factors  from  the  original  data. 

Equation  (21)  was  used  to  find  the  regression  coef- 
ficients for  the  ten  important  principal  components,  using 
the  six-place  values  of  the  loa.dings  and  eigenvalues  listed 
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in  Tables  V  and  VI ,    The  computations  of  Equation  (21) 


F.  - 


and  the  procedure  of  Equations  (I7)   through  (20)  were 
done  by  a  short  computer  program  written  for  this  purpose, 
giving  the  coefficients  of  the  measured  independent  vari- 
ables.    These  are  given  in  Table  XI  at  the  end  of  this 
chapter,  along  with  the  results  of  the  other  regression 
analyses.     Multiplicative  equations  for  both  dependent 
variables  may  be  obtained  from  this  table  and  Equation  (2) 

Y  =  Bq  X^^l  x/^  X^®3  .   .   .  Xp^P  (2) 

where        is  the  antilogari thm  of  the  constant  of  regression. 

An  ordinary  linear  multiple  regression  analysis  was 
perforQied  for  each  of  the  two  dependent  variables  and  the 
ten  rotated  factors  for  each  of  the  three  interpreta- 
tions of  the  initial  rotation,    A  computer  program,  "One- 
Pass  Multiple  Regression",  written  by  the  Computing  Center 
Staff  for  general  use,  was  employed.     Input  to  this  program 
for  each  analysis  was  50  observations  of  each  dependent 
variable  and  50  computed  values  of  ten  factors,  because  the 
regression  was  to  be  performed  on  the  factors  and  not  on 
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the  iraportant  variables.     The  50  values  of  the  10  factors 
for  each  regression  were  computed  and  punched  by  a  program 
T-rritten  to  comply  v/ith  Equation  (22).     The  x,  .  values  in 
this  equation  are  observations  of  the  standardized  form  of 
the  iraportant,  log-tra/nsf ormed  variables.     The  standardized 
values  were  computed  from  Equation  (19)  for  all  50  observa- 
tions, and  only  the  measurements  of  the  18,   14,  or  6 
important  variables  were  used  in  the  computations  for  the 
values  of  the  factors. 

The  results  of  the  regressions  were  the  coefficients 
for  the  rotated  factors,  G    and  G.,  which  had  to  be  reduced 
to  coefficients  for  the  im^portant,  log- transformed  independ- 
ent variables.     The  values  of  Gq  and  G    were  substituted 
into  Equations  (23),  and  a  computer  program  for  these 
computations  yielded  the  va,lues  of  3^  and  3^  for  Equation 
(2),  the  desired  multiplicative  form  of  the  prediction  equa- 
tion.    Table  XI  summarizes  the  results  of  all  eight  regres- 
sion analyses.     The  anti logarithms  of  the  constants  are  giv- 
en, and  any  of  the  prediction  equations  may  be  obtained  from 
Equation  (2),  where  the  X*s  are  the  field  measurements  of 
the  variables,         is  the  antilogari thm  of  the  regression 
constant,  and  the        are  the  coefficients  listed.  For 
example,   the  equation  for  the  pea^^^  discharge  rate  from  the 
second  factor  regression  is  found  from  Table  XI  and 
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Equation  (2)   to  be 


.263100       .628922          .3182^-9  .091^39 
,          ^  A                AZ                ELEV  POND 
QMM  =  l.i.76896  —--s^-^-^  35-3-5^-9  ,-^-^2-5  

SHP  GNDS  ISD 


.151^3^ 

I 

X  — 


.068278  .660111  .382531  .^^32310 

TPGP  WEEK  AIRT  DSGD 


The  above  equation  was  obtained  from  the  fifth  column  of 
Table  XI,  and  an  equation  for  the  runoff  volume  may  be 
obtained  in  a  similar  fashion  from  the  sixth  column. 
The  units  for  OMAX  in  the  above  equation  and  in  the  other 
equations  for  QMAX  are  cubic  feet  per  second,  and  the  units 
for  RUNF  are  inches,  providing  that  the  units  of  the  inde- 
pendent variables  are  the  same  as  the  units  used  in  Table  II, 


U  ft:; 


o 
+= 
o 


C\J  Ph 

•  ^ 

bD 
CD 

U 
O 

-P  >< 

o  <d 


rnco 

•  • 

O  OJ 
I 

CO 

LPv  rn 

•  • 


O 


CM 

VD 
O- 

o 
o 


VD  ^CO  VD 
O  VD  C-- 
(M  C\l  T-  OD  rn 
rO  v-  VD  C\J 
rO  m  rO  03  O- 
ro      O  <M  O 

c       •       •       •  • 

O  V-  o  o  o 

II  II 

O  0^0^ 
O  c\j  c\j  C\J 
V-       CJ>  CM  ^ 
t>-  CO  CO  cx) 

VO  CO  CM  r-  O 
CM  ro  VO  CO  ro 

O  O  O  O  O 
I  I 


T~  ro 
CO  Lr\ 

VD  CO 
CM  O 
CNJ  VD 
CM  ro 

•  • 

o  o 
I 

O  OJ 
in  i>- 
CM  ir\ 
cr\ 

in  r^) 
^-  ro 

•  • 

O  O 
I 

o  ooo 

CM  O  VD 

T-  ro  o-^ 

rO  0'> 
O  C\l 

•  •  • 

OOO 

I  I 

Cr\  LDv 
ro  r-o  CM 
■vt-  '=;l-CO 
V-  ^  Q-^ 
LTv 

o  ^  ^- 

•  •  • 

ooo 
I 


OO 
rO 
CM 


O 
I 

CO 

O 
PO 
rO 
• 

O 
I 

rO 


O 


CO 

CM 

CO 
VD 

o 

o 
I 


VD  Lr\  cr>  ^  CM 

VD 

rO  T-  CNJ  in 

CO 

CO  o  CO 

cr\co  Lr\  o  CM 

CM 

vo  V-  cr» 

o^ 

c\j         VD  VD  in 

O 

CO  "^ci-  O^  rO 

VD 

^  LTN  ^-  '^j- 

rO 

LOi  ro  O  C\J  VD 

O 

PO  OMr\  OJ  CvJ  o 

CM 

O  ^  V-  O  O 

CM 

CM  O       O  ^ 

•    •    •    •  • 

• 

•  ••••• 

• 

O  O  O  O  O 

O 

o  o  o  o  o  o 

o 

1  1  1 

1 

11  II 

Li^OD  T-  o 

VD 

OA  VD  cr>  "T-  o  po 

LPv  cr»  in  ^ 

CTv  VD  pO  ro  LPv  T- 

CNJ 

tn  CM  o-  Lr\  LTx 

tr- 

Lr\     o  "^i-  OJ 

CO  ^  O  ^ 

VD 

po  po  cn  r-  CM 

Cn  VD  VD  t>-  CM 

T-  m  i>- 1>-  o  LOv 

VD 

10*0  10  0"=^^- 

O 

•     •     •     •  • 

• 

•  ••••• 

• 

O  O  O  O  O 

O 

O  O  IXN  o  o  o 

o 

1  1 

1 

i  1  1 

1 

ro  VD 

CO  V- 

V-  CO 

'Si-  ro 

•  • 

o  o 
I 


rOCO 

VD  '=:t- 
O  ^ 
ro  -i- 
CM 

•  • 

o  o 
I 

T-  m 

O  OJ 
VD  CO 
VD 

•  • 

0  o 

1  I 

VD  VD 
O  1>- 
r-  VD 

o  o 

OJ 

•  • 

o  o 
I 

CO  o 

r-  VO 

cnoo 

CTiCM 

m  po 

•  • 

0  o 

1  I 

c^\Oc^\c^^ooJo^■^-(^--ooJ•vJ-^^-'^rocococ^lVDcoor^vDPOv-cM^ 
^T-(X)cou'^LnLP\CMLr\'^OOu^'^'=i'!>-C^OcrvCJNCM^o^v-cr\00-f-cov^ 

CMC^^CMLP^CM^--VDVDLOlroOcO'=;^L^N^•^COlJ^Lf^OVD•^— COOVDVDLTN  _ 
'^^COO-^-O^OJCMO'=^OU-AVD  LfN'^J-cr\'^T-00  irNcr>ojr^icr>CM  laco  ro 

^O't-O'r-on0^^'vi-aN^CNJOf-O'=it-OJCMC7Nr^LrNC~-t>-        ~    ^   "    "  ~ 
OJCNJOOOO'<~PO^O-^OOOOvJOPOCMOOJv-r- 


"  1 

^ — -y 

o 

CO 

CNJ 

t>- 

VO 

VD 

rn 

PO 

• 

• 

o 

CM 

CO 

~  )  CM  UJ 

LTN 

CO 

<-0  rO  CTv 

VD 

VD  VD  cr> 

O 

CNJ 

CO  POVD 

CO 

r—  VD  CO 

VD 

00  O  VD 

* 

• 

•    •  • 

00 

o 

o  o  ^ 

1  1 

o  p  I  cj 

VD 

'=^^ 

•V—  (JMO> 

cr> 

ITN 

<-oco  <^ 

CO 

CM 

CM  CO  cr\ 

VD 

o 

rO  r-- VO 

i>- 

rO 

^  O  ^ 

• 

•    •  • 

• 

o 

OOO 

r — 

i  1 

CO  o  o 

VD 

LTN 

•r-  r- 

VD 

o- 

^  O^LTN 

PO 

00  O  OJ 

ro 

rO 

VO  VD 

CP> 

CO  O  LTN 

CM 

• 

•     •  * 

• 

O 

OOO 

O 

1  1  1 

OJ 

T—  CO  <~oco 

o 

rOCO 

1 

in 

1—  LTN 

o 

LTN 

Cn  CM  PO 

O"^ 

T-  CM 

o 

rOO 

LTN 

• 

•    •  • 

o 

o  o 

• 

1 

1  i  i 

^  CNJ 

CO  VD 

o 

  .  .t-  V- 

O  O^CO  CJN  o  o 
LPv  cr\  o  ^  OJ  o  Cvj 


LTN 
CM 

o 

ON 
LTN 


o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  o  O  O  O  O  O  O  O  O  V- 
III      I  I  I  I  I  I  I      II      i        I        II      II  II 

^vDO^-C7^vD•vtPO^Lnococ^\cnpoO"=^t^c^\coLc^rocMVD'^Lr^[^--L^ 

ii^c^c^t>-oj'^^-'^';t'=^^ocr»voc^r~--ii^o^r--Ln-^-oj  t—oo     cm  ^^j-  m  o  on  cr»  I 

O^ -r-  CM  t-- O  -^^f      l>- '==t  LPs  ro  ro  in  VD  ^  PO  IXN  cr\  ON  O  PO  LTN  (>- O 

"=:J-CMVD  •^LTNrO'^CMaDnOPOCMu-N  rOCO  ^--"=;^C7^r-^OVDCOCO^-'^l-VDOPOrOLCN  t- 
VO  I>-  IXNCO  VD  O  LTx  PO  T-  V-  VD  ^OCO       VO  '=:^  CO  CM  CNJ  O  ^OCO  '^j-VDC^VDCNJCOv-O'N  ><! 

OOPOrOT-CMOO^OcOOOOOOv-0'~OO^OOvJpnmOT-OOC^  VD 

o  o  o  o  o  o  o  o  o  o  coo  O  O  O  O  O  O  O  O  O  O  O  O  O  O  O  O  O  PO  ^ 


I  I 

W  p  9  pq 
IS]      k  ^  Cci 


I  I  I 


I 


i  I 


I  I 


I 


I  I 


p 

CO 


<i{  GQ       W  C5  CO  :Z)  H  Ph  H  M 


Ph      S  prj  ^  ^  P      Q  eh  P  P  O*  . 

PnOMi^HHKOQWCOHqCOc^^M-P 

pQHPboo£iMEH>^oeiw^ 


CO 

o 


II 
o 

H 


Chapter  V 
DISCUSSIOiJ  OP  RESULTS 
The  methods  employed  and  the  results  obtained  in  the 
previous  chapter  indicate  that  the  multivariate  procedures 
provide  a,  statistical  means  of  determining  the  variables 
V7hich  are  more  important  to  runoff  than  other  variables. 
A  discussion  of  the  variables  and  the  prediction  equations 
involving  the  variables  follows, 

yARIABLES  IMPORTANT  _  TO, jm^ 

The  variables  which  were  selected  for  this  study  are 
believed  to  have  been  the  best  possible  with  the  available 
datao     Each  possibility  was  carefully  analyzed  before  the 
final  decisions  were  made«     The  possibility  of  neglecting 
unknoxm  variables  which  have  considerable  importance  to  the 
runoff  always  exists  in  any  hydrologic  analysis,  because  the 
processes  and  factors  are  not  yet  completely  defined  (Sharp 
and  Bisvras,  I965).     The  variables  were  selected  according 
to  criteria  set  forth  by  Chow  (196^),     They  are  grouped 
into  two  categories,  "climatic"  and  "physiographic » "  The 
climatic  fa.ctors  are  sub-classified  as  those  measuring 
four  processes  of  hydrology:     precipitation,  interception, 
evaporation,  and  transpiration.     The  physiographic  factors 
have  two  major  sub- sets,  basin  and  chan_nel  characteristics o 

Practically  all  of  the  factors  listed  by  Chow  were 
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present  in  this  study.     The  precipitation  factors  of  in- 
tensity, duration,   time  distribution,  areal  distrubution, 
direction  of  storm  movement,  antecedent  precipitation, 
and  soil  moisture  were  directly  or  indirectly  measured 
by  the  variables  I,  D,  TDF,   TPCP,  WDIR,  API,  and  SOLM, 
respectively. 

The  interception  factors  were  probably  not  measured 
as  well  as  the  others,  but  the  land-use  variable,  USE,  . 
indirectly  measured  the  vegeta.tion.  This  va.riable,  com- 
bined I'iith  the  week  of  the  year,  and  possibly  the  wind 
velocity,  should  give  any  model  "feeling"  for  the  amount 
of  interception  which  might  occur  in  a  storm. 

An  accurate  amount  of  evaporation  is  extremely  hard 
to  measure,  but  if  the  variables  which  affect  evaporation: 
air  temperature,  soil  temperature,  wind  velocity,  azimuth 
(North-sloping  vs  South-sloping),  areal  extent,  pondage, 
and  possibly  soil  moisture;  are  measured,  then  the  model 
has  probably  been  given  an  estimation  of  the  evaporation 
which  might  occur. 

Transpiration  is  relatively  unimportant  for  un- 
vegetated  watersheds.     The  variables  USE,  SOLM,  WEEK,  AIRT, 
W^/EL,  and  SOLT  were  included  to  measure  vegetation,  soil 
moisture,   season,  air  temperature,  wind  velocity,  and  soil 
temperature,  and  all  of  these  probably  affect  the  amount 
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of  transpiration  v/hich  occurs  during  a  runoff  event.  Also, 
transpiration  is  generally  negligible  during  periods  of 
rainfall,  and  it  would  be  logical  to  assume  that  small  loss- 
es of  moisture  by  this  process  occurred  during  the  rela- 
tively short  storm  periods  used  herein.     The  variables  DEGD 
and  SV/EQ  were  included  because  snow-melt  runoff,  although  a 
function  of  many  variables,  is  at  least  related  to  the  ante- 
cedent temperature  and  the  volume  of  water  in  the  snowpack 
before  an  event.     Snovv'melt  runoff  events  comprised  about 
one-third  of  the  runoff  events  of  this  study,  and  a  predic- 
tion equation  applicable  to  these  v/as  desired. 

The  physiographic  factors  of  size,   shape,   slope,  orien- 
tation, ponds  and  reservoirs,  channel  slope,   and  channel 
length  v/ere  measured  by  the  variables  A,   SHP,  GNDS,  AZ, 
ELEV,  PREQ,   INFR,  POPiD,   S,  and  L,  respectively.     Some  fac- 
tors suggested  by  Chow  which  were  not  included  in  this 
analysis  were  the  frequency  of  occurrence  of  precipitation, 
atmospheric  pressure,  shape  of  evaporative  surface,  solar 
radiation,  humidity,  ground  water  capacity,  sediment  trans- 
port, and  channel  shape  and  roughness. 
I]\1PQRTANT  VARIABLES  FROM  THE  ANALYSES 

The  variables  selected  as  the  most  important  to  each  of 
the  factors  in  Table  IX  follow  the  interpretation  of  the 
factor  rotation  (Table  VII).     The  variables  I,   ISD,  and 
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SOLT  in  factor  1  were  not  v/ritten  according  to  the  factor 
loadings  for  tv/o  reasons.     First,  the  weather  stations  were 
not  recording  for  14  of  the  50  storms,  and  the  treatment 
for  missing  data  had  to  be  applied  to  obtain  the  weather 
station  variables  for  these  storms.     The  intensities  and 
standard  deviation  of  intensities  were  measured  for  all 
storms,  and  the  author  therefore  felt  justified  in  placing 
the  soil  temperature  variable  in  the  third  position  of  im- 
portance to  this  factor.     The  high  loading  in  this  factor 
for  SOLT  v/as  therefore  probably  due  to  the  missing  data 
treatment  used  for  this  variable.     The  average  soil  temper- 
ature for  the  same  time  period  in  other  years  of  measurement 
may  not  have  been  the  best  choice  for  the  value  to  assign 
to  missing  storms,     Hov/ever,  the  air  temperature  was  also 
missing  for  these  storms,  and  no  other  alternative  v/as 
available. 

All  of  the  other  variable  "placings"  in  Table  IX  ap- 
pear to  be  logical.     The  position  of  SOLM  in  factor  2  might 
be  questioned,  not  from  a  hydrologic  point  of  view,  but 
from  an  error  interpretation.     The  measurements  of  the 
soil  moisture  at  any  of  the  three  depths  were  probably 
the  least  accurately  obtained  of  all  the  variables.  The 
soil  moisture  readings  were  taken  from  variations  in 
resistance  measured  across  electrodes  embedded  in  gyps^um 
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blocks,  and  the  resistance  readings  were  converted  to  water 
content  va-lues  by  means  of  calibration  charts.     The  read- 
ings did  not  seem  to  fluctuate  considerably  from  day  to 
day,  and  only  occasionally  did  they  change  during  a  storm. 
The  calibration  was  not  well  established,  and  some  of  the 
soil  moisture  values  used  may  therefore  be  questionable. 
In  any  event,  soil  moisture  had  a  small  correlation  with 
all  the  factors,  and  since  no  information  on  the  accuracy 
of  the  readings  was  available,  the  ve.riable  was  discarded 
from  further  analyses.     Intuitively,  the  soil  moisture, 
or  at  least  the  rate-change  in  soil  moisture  shortly  after 
a  storm,  v/OLild  indicate  the  amount  of  precipitation  that 
was  being  lost  to  the  soil.     Also,  the  soil  moisture  be- 
fore a  storm  should  indicate  the  capacity  of  the  soil  for 
receiving  the  precipitation.     This  variable  should  there- 
fore not  be  excluded  from  future  analyses  simply  because 
this  study  resulted  in  its  being  deemed  unimportant. 

Variables  deemed  important  in  each  of  the  three  factor- 
regressions  deserve  discussion.     In  the  third  regression, 
four  of  the  six  variables  chosen  from  the  first  six  factors 
appear  in  virtually  all  regression  equations  that  have 
been  proposed  for  both  peak  discharge  rate  and  runoff  vol- 
ume.    The  v/atershed  area,  watershed  shape,  storm  intensity. 
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and  total  amount  of  precipitation  are  intuitively  and,  from 
the  results,  shov/n  to  be  mathematically  important  to  rim- 
off.     The  high  importance  of  the  maximum  ground  slope, 
GI-IDS,  could  be  attributed  to  two  possible  explanations. 
Error  of  measurement  may  have  caused  an  "apparant"  im- 
portance, because  the  end  of  the  blue  line  on  U.S.G.S. 
maps  may  not  be  indicative  of  the  actual  main  channel 
head.     Y/hen  this  fact  is  combined  v/ith  the  questionable 
precision  of  contours  on  the  maps,  a  strong  possibility 
for  errors  in  the  measurement  of  this  variable  exists. 
An  alternative  explanation,  v/hich  justified  the  importance 
of  this  variable,  is  the  supposition  that  these  were 
"small"  watersheds c     Because  the  amount  of  overland  flow- 
rather  than  channel  flow  is  the  criterion  (Chow,  p.  14-5) 
for  defining  a  small  watershed,  then  ground  slope  is  more 
important  to  peak  discharge  rates  for  smaller  watersheds. 
Neither  of  these  reasons  for  the  importance  of  GIIDS  in  the 
rotated  factors  could  be  selected  as  the  better,  and  further 
research  is  suggested  for  the  solution. 

The  sixth  variable  in  the  third  regression  equation, 
pondage,  and  the  main  channel  slope  were  the  only  impor- 
tant variables  in  the  fourth  rotated  factor.     The  great 
difference  between  the  coefficients  for  these  variables 
forced  the  use  of  POIID  from  this  factor  rather  than  S. 
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Had  the  correla,tion  of  S  v/ith  this  faxtor  been  larger, 
the  author  would  have  used  S  instead  of  POI^ID  in  the  third 
factor-regression,  because  the  pondage  variable  was  diffi- 
cult to  mea,sure.     Pondage  is  certainly  important  to  run- 
off volujnes  and  peal:  discharge  rates,  but  the  method  of 
obtaining  this  variable  should  create  some  doubt  about 
the  high  importance  indicated  by  the  analysis. 

.  All  three  variables  measuring  the  wind  velocity  and 
direction  were  deemed  unimportant  by  the  analysis.  The 
measurements  of  the  variables  were  fairly  accurate,  and 
the  indication  of  the  rotation  is  probably  correct.  Evap- 
oration may  not  have  been  significant  because  the  time 
from  the  beginning  of  precipitation  to  the  end  of  the  run- 
off hydrograph  for  most  of  the  storms  was  relatively  short. 

Duration  and  time  distribution  of  the  storms  were 
other  variables  deemed  unimportant  by  the  analysis.  The 
author  feels  that  this  is  probably  due  to  the  inclusion 
of  snov/melt  events  in  the  analysis.     Because  the  last  major 
snowfall  before  the  respective  snov/melt-runof f  was  used, 
the  duration  was  generally  large  for  these  events,  and  the 
time  distribution  was  generally  zero.     Because  peak  dis- 
charge rates  from  "small"  watersheds  are  highly  sensitive  to 
short  duration,  high-intensity  storms  (Chow,  196^),  the 
large  durations  of  snowfall  storms  probably  caused  the 
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reduction  in  the  importance  of  these  two  variables.  If 
snov/melt  events  had  been  excluded,  one  of  these  variables 
might  ha.ve  been  foimd  to  be  more  important  to  nmoff . 

Both  the  variables  measuring  distance  were  deemed 
unimportant  by  the  rotation.     The  meander  length  of  the 
main  channel,  L,  is  usually  not  important  for  "sm.all" 
watersheds,  because  overland  flow  rather  than  channel 
flow  is  more  important  to  peak  discharge  rates.     The  cor- 
relation of  0,80  of  this  variable  with  the  fifth  factor 
in  Table  VII  was  relatively  large  for  rejection,  but  the  im- 
portance of  factor  5  to  the  variance  makes  this  correla- 
tion less  important  than  an  equal  correlation  in  one  of 
the  first  four  factors.     The  variable  is  obviously  impor- 
tant to  the  information,  but  the  quest  for  as  few  variables 
as  possible  for  the  prediction  equation  led  to  its  ex- 
clusion in  the  regression.     The  overland  distance,  GNDL, 
was  not  nearly  as  important  as  the  m^ain  channel  distance, 
v/hich  is  contrary  to  the  previous  definition  of  "small" 
watersheds.     However,  as  with  the  overland  slope,  this 
may  have  been  difficult  to  accurately  measure,  giving  a 
possible  reason  for  the  apparent  unimportance  of  GIIDL. 

VARIABLE  INTERCORPlSLATIONS 

The  factors  of  Table  Vll  indicate  that  certain  names 
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may  be  applied  to  some  of  the  factors.     Also,  beca^use  the 
factors  are  uncorrelated,  the  important  variables  of  one 
factor  are  relatively  Lincorrelated  with  the  variables  of 
another  factor.     The  development  also  shows  that  the  vari- 
ables v/ithin  a  factor  are  usually  related.     All  of  these 
facts  may  be  used  to  provide  information  about  the  inter- 
correlations  of  the  variables. 

The  first  factor  was  highly  associated  with  storm 
variables  and  snov/nielt  variables,  and  might  be  called  a 
"snov/melt"  factor.     As  shov/n,  the  duration  of  the  storm 
is  correlated  only  v/ith  this  factor,   justifying  the  earlier 
statement  that  the  inclusion  of  snowraelt  events  affected 
the  duration  variable.     Factors  6,  7,  8,  9,  and  10  could 
be  assigned  names  such  as  *'wind,"     "total  precipitation," 
"deviate  air  tempera tujre , "  "storm  time  distribution,"  and 
"v/ind  direction,"  even  though  the  variance  of  each  factor 
was  small . 

Factors  2,  3,  4,  and  5  s-re  not  as  easily  named  as  the 
rest.     They  are  all  highly  associated  with  physiographic 
variables  only,  indicating  that  they  explain  the  water- 
shed characteristics,  and  that  the  characteristics  are  un- 
related to  the  storm  factors  because  they  are  in  different 
factors.     This  simply  means  that  knowledge  of  the  character 
istics  of  a  watershed  usually  gives  no  insight  into  the 
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storm  characteristics » 

In  the  second  factor,  the  soil  variables  are  preva- 
lent.    The  soil  moisture,  infiltration  rate,  and  ground 
slope  could  very  well  be  related.     The  first  has  a  defi- 
nite effect  on  infiltration  rate  (Chov/,  196^)  *  and.  the 
ground  slope  would  also  reflect  the  amount  of  moisture 
retained  by  the  soil.     The  additional  presence  of  azimuth 
and  elevation  in  this  factor  led  the  author  to  an  inves- 
tigation of  the  soil  types  as  the  elevation  changed  from 
watershed  to  watershed.     Although  the  soil  types  were 
not  remarkably  different,   the  infiltration  rate  decreased 
as  the  elevation  increased,  indicating  that  these  vari- 
ables may  be  related.     The  correlation  coefficient  for 
the  actual  measurements  of  these  variables  was  0,66,  and 
this  v;as  among  the  largest  values  for  all  the  correlations. 
i^iJ-so,  the  azimuth  variable  v/as  highly  correlated,  with  both 
the  elevation  and  infiltration  variables.     It  seems  reason- 
able to  state  that  a  North  or  West-sloping  watershed  might 
contain  different  soils  and  infiltration  rates  than  a 
South  or  East-sloping  basin.     Prevailing  winds,  glacier 
movements,  or  other  factors  could  easily  cause  these  dif- 
ferences. ,; 

Factor  3  indicates  that  the  overland  distance,  stream 
frequency,  watershed  shape,  and  land  use  are  related. 
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Because  "the  stream  frequency  varies  approximately  as  the 
square  of  the  "drainage  density"  (Cho'vv,  196^),  vrhich 
is  a  measure  of  the  closeness  of  channel  segments,  and 
because  the  overland  flow  distance  is  inversely  related 
to  drainage  density  (Chow,  196^^),   then  the  correla- 
tion  of  GiTDh  and  FREQ  in  this  factor  is  justified  by  the 
literature.     Also,  the  shape  of  Montana  watersheds  may 
in  fact  be  related  to  the  overland  slope  and  stream  freq- 
uency, although  no  statements  to  this  effect  could  be 
found  in  the  literature.     The  presence  of  USE  in  this 
factor  seems  to  have  no  reasonable  explanation. 

The  fourth  factor  indicates  that  the  main  channel 
slope  and  the  pondage  variables  are  related.  Although 
the  correlation  coefficient  of  these  variables  in  the 
actual  measurement  form  v/as  small,  a  coefficient  of  -0.85 
was  obtained  for  the  logarithmic  forms,  indicating  a  rela- 
tively linear  plot  on  logarithmic  paper.     The  negative 
correlation  indicates  that  more  ponds  and  reservoirs  were 
found  on  watersheds  having  milder  slopes.     This  could  be 
attributed  to  the  fact  that  the  "steepest"  v/atershed, 
Hump  Creek,  had  no  visible  reservoirs  on  the  maps. 

The  close  relationship  of  area  and  main  channel 
length  is  clearly  indicated  by  the  fifth  factor,  v/here 
these  are  the  only  two  variables  highly  correlated. 
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This  relationship  has  "been  knovm  to  exist  for  several 
years  (Chovv,  1^6^')  ^  and  the  rotation  simply  adds  justi- 
fication to  the  relationship e 

The  remaining  factors  indicate  that  the  average 
v/ind  velocity  is  related  to  the  standard  deviation  of 
wind  velocities,  which  is  not  difficult  to  accept.  The 
relationship  between  total  precipitation  and  antecedent 
precipitation  in  the  seventh  factor  is  interesting.  A 
positive  correlation  for  these  variables  indicates  that 
a  "large"  storm  was  typically  proceeded  by  a  "large*' 
amount  of  precipitation  in  the  14  days  prior  to  the  be- 
ginning of  the  storm.     The  actiial  beginning  of  the  runoff- 
producing  storm  was  sometimes  difficult  to  determine,  and 
this  may  be  the  reason  for  the  relationship  between  these 
variables.     A  runoff -producing  storm  was  frequently  ob- 
served a  day  or  two  after  another  smaller  rainfall  v/hich 
apparently  produced  no  runoff.     The  time  of  concentration, 
or  the  time  for  rainfall  at  the  farthest  reach  of  the 
watershed  to  arrive  at  the  runoff  gaging  station,  assum- 
ing a  uniform  intensity  over  the  watershed,  was  estimated 
to  be  less  than  one  or  two  days  even  on  the  largest  water- 
shed.    The  storms  which  had  no  antecedent  help  usually  pro- 
duced runoff  in  only  a  few  hours,  and  this  criterion  was 
used  in  selecting  the  beginning  point  for  the  precipitation 
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v/hich  caused  the  runoff. 

As  shov/riy  the  methods  employed  seem  to  substantiate 
known  relationships  of  certain  variables.     Some  nev/  re- 
lationships for  the  locale  of  the  watersheds  were  dis- 
covered, and  considerable  insight  into  the  importance  of 
some  of  the  veiriables  has  been  obtained.     The  "simple 
structure"  of  the  rotated  factors  in  Table  Vll  provided 
the  informa^tion  discussed,  and  the  agreements  with  the 
literature  have  positive  indications  of  the  adequacy  of 
the  methods. 

jgGjjSSSION  EQUATIONS 

The  coefficients  for  the  variables  in  the  regression 
equations  for  total  runoff  and  peak  discharge  rate  were 
listed  in  Table  XI  for  the  four  chosen  combinations  of 
variables.     The  dimensional  "imbalance"  of  the  equations 
is  accounted  for  by  the  constant  for  each  equation.  If 
units  other  than  those  in  the  derivation  are  used,  then 
additional  conversion  constants  must  be  added. 

Peak  Discharge  Rate 

In  the  first  equation  for  peak  discharge  rate,  the  ■. 

coefficients  on  infiltration  rate,  both  slopes,  soil 

moisture,  snov/-water  equivalent,  air  temperature,  degree 
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days,  and  stream  frequency  were  negative,  indicating  that 
the  peak  discharge  rate  increases  a,s  these  variables  de- 
crease«     It  v/ould  seem  that  a  larger  slope  should  produce 
larger  peak  discharge  rates,  because  the  velocity  of  over- 
land and  channel  flow  is  directly  related  to  slope.  Because 
the  peak  discharge  rates  v/ere  measured  in  the  chamiel,  the 
size,  shape,  and  rougliness  of  the  channel  are  also  deter- 
minants, and  the  slope  cannot  be  considered  alone.  Also, 
soil  moisture  and  stream  frequency  are  variables  v/hich 
should  vary  directly  with  peak  discharge  rate.     Negative  co- 
efficients on  "obvious"  variables  such  as  area,  intensity, 
or  total  precipito,tion  v/ere  not  present,  and  this  indicates 
that  "reasonable"  coefficients  were  obtained  for  most  of 
the  variables.     The  positive  coefficient  for  pondage  is 
questionable,  because  reservoirs  are  generally  installed  to 
reduce  pes.k  discharge  rates.     If  measurements  of  all  the 
variables  in  this  equation  are  available,  then  it  should 
give  the  best  prediction  for  peak  discharge  rate  because  it 
explains  93.4  per  cent  of  the  variation  in  the  original 
data,  regardless  of  the  respective  signs  of  the  exponents. 

The  three  factor-regression  equations  for  peak  dis- 
charge rate  each  had  some  intuitively  incorrect  signs 
for  the  exponents.     All  three  indicated  that  total  pre- 
cipitation is  inversely  related  to  pea,k  discharge  rates. 
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and  only  two  indicated  a  direct  relation  to  intensity. 
This  would  mean  that  short-duration,  high-intensity  storms 
produced  the  greatest  discharge  rates,  confirming  the 
statement  that  these  were  "small"  watersheds.     The  negative 
coefficient  for  total  precipitation  seems  intuitively  in- 
correct, but  the  definition  of  small  v/atersheds  allows  an 
inverse  relationship  of  peak  discharge  rate  with  duration, 
and  probably  v/ith  total  precipitation  because  this  generally 
increases  with  duration. 

As  in  the  principal  component  regression  equation, 
the  pondage  variable  for  the  last  three  equations  v/as 
directly  related  to  peak  discharge  rate.     The  reason  for 
this  apparent  error  is  unexplained,  and  deserves  further 
investigation  if  the  signs  of  the  coefficients  are  accepted 
as  being  correct. 

The  area  was  found  to  be  directly  related  to  peak  dis- 
charge rate  in  all  the  equations,  agreeing  with  the  Ration- 
al equation  for  runoff.     Hov/ever,  the  ground  slope  coef- 
ficient was  positive  only  in  the  last  equation,  and  the 
lack  of  information  of  the  channel  shape  and  roughness 
might  have  been  at  fault  here.     -AJ.S0,  this  may  have  been 
caused  by  the  fact  that  few  runoff  events  with  large  peak 
discharge  rates  were  included  for  the  "steepest"  water- 
shed. Hump  Creek. 
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Other  coefficients  for  variables  in  the  equations 
for  peak  discharge  rate  could  not  be  satisfactorily  evalu- 
ated, because  too  little  was  knov.ai  about  their  physical 
influence  on  discharge.     The  peculiarities  of  the  re- 
search Vv'atersheds  and  measured  storms  may  have  resulted 
in  some  of  the  incorrect  signs.     The  equations  are  not  as 
sensible  as  expected,  but  the  theory  states  that  because 
they  were  obtained  from  uncorrelated  variates,  they  at 
least  satisfy  regression  requirements. 

Total  Ptunof f 

The  eqiiations  for  total  runoff  also  exhibited  in- 
tuitively incorrect  signs  for  a  few  coefficients.     The  most 
obvious  was  the  inverse  relationship  of  total  runoff  and 
watershed  area  in  all  the  equations.     This  indicates  that 
3.arger  runoff  volumes  were  produced  from  smaller  water- 
sheds, and  may  be  due  to  the  fact  that  the  largest  run- 
off volume  occurred  from  a  storm  on  Hump  Creek.  This 
storm  had  a  particularly  long  duration  with  a  small  peak 
discharge  rate  and  a  large  total  volume  of  runoff.  The 
possible  influence  of  one  storm  on  the  final  equations  in- 
dicates that  the  method  might  be  overly  sensitive  to  indi- 
vidual storms,  but  this  statement  was  not  supported  by 
the  literature.     Further  studies  are  suggested  if  these 
methods  are  to  be  employed  in  future  regression  analyses. 
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The  ground  slope  was  inversely  related  to  runoff  volume 
for  the  first  three  equations,  and  this  might  be  viewed  as 
incorrect,  because  the  milder  slopes  should  retain  more 
precipitation,  thereby  reducing  the  total  runoff.  However, 
infiltration  rates  and  antecedent  soil-moisture  conditions 
must  also  be  considered.     The  slope  cannot  be  separated 
from  other  variables  in  determining  the  correct  relation- 
ship with  runoff,  and  the  signs  obtained  raa.y  be  correct. 

Another  possible  intuitive  error  in  the  equations  for 
runoff  is  the  signs  for  the  coefficients  of  intensity  of 
precipitation,  v/hich  are  negative  for  all  eo^uations.  High 
intensity  storms  should  reasonably  produce  large  runoff 
volumes;  however,  other  variables  such  as  duration  and 
the  snov/melt  variables  must  also  be  considered.     The  lov/- 
intensity  snow  storms  with  long  durations  could  have  pre- 
deeded  large  runoff  events,  giving  a  possible  explanation 
for  the  inverse  relationship. 

The  total  precipitation  should  obviously  be  directly 
related  to  total  runoff,  and  this  was  found  to  be  true  in 
the  first  three  equations.     The  negative  coefficient  in 
the  fourth  equation  should  definitely  be  questioned  for 
this  variable.     Large  snov/me It-runoff  volumes  for  rela- 
tively small  amoLints  of  total  precipitation  could  have 
created  this  error,  but  this  should  have  caused  negative 
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signs  in  all  the  equations  and  may  not  be  a  reasonable 
explanation. 

In  summary,  it  sould  seem  that  the  equations  for 
peak  discharge  rate  and  total  runoff  v/ere  theoretically 
accurate,  but  the  sensibility  of  the  coefficients  was  not 
as  consistent  as  the  literature  had  predicted.     Most  of  the 
incorrect  signs  could  be  j'ustified  by  stating  that  other 
variables  or  watershed  and  storm  peculiarities  were  respon- 
sible, but  some  coefficients  could  not  reasonably  be  placed 
in  this  category.     The  suggestions  for  the  limitations  of 
these  equations  in  the  next  section  should  provide  caution 
in  any  predictions. 

LIMITATIONS  OF  RESULTS 

Statistical  conclusions  from  four  years  of  informa- 
tion on  five  watersheds  may  not  be  reliable.     In  general, 
hydrologists  prefer  a  much  longer  data  period.  However, 
no  frequency  studies  were  made  in  this  report,  and  the  au- 
thor felt  that  the  size-range  of  storms  and  floods  was  well 
represented  by  the  data.     Discharge  rates  ranging  from  10 
to  1720  cfs,  and  runoff  volumes  between  0.006  and  I.8O6 
inches  were  present.     Because  no  frequency  analyses  were 
performed,  and  because  the  mechanics  of  the  variables  was 
the  prime  objective  of  this  study,  sufficient  data  was 
felt  to  be  present.      The  measurements  of  the  variables 
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were  made  during  each  runoff  event,  and  measurements  of 
the  mechanics  of  the  variables  were  therefore  taken.  Had 
only  a  single  runoff  event  been  observed,  then  the  rotation 
method  should  still  have  indicated  the  important  variables 
for  the  storm,  and  the  regression  analysis  should  have 
yielded  an  equation  which  predicts  that  particular  storm. 
The  use  of  50  runoff  events  simply  extends  the  information 
about  the  mechanics  of  the  variables.     Recognition  of  the 
mechanics  from  the  data  is  attempted  in  this  report,  but  is 
not  yet  com.plete. 

The  regression  equations  are  probably  not  as  relia- 
ble as  the  information  of  the  variables.     Long  term  vari- 
ables such  as  annual  snow  fall,  solar  re.diation,  pre- 
vailing winds,  or  watershed  "treatments"  such  as  con- 
servation practices,  crop  rotation,   etc.,  could  not  be 
measured  in  the  data  period.     Prediction  v/ith  the  equa- 
tions should  probably  be  confined  to  storms  occurring 
within  3.  time  period  which  is  not  affected  by  the  long  . 
term  variables.     This  period  remains  to  be  determined. 

With  the  above  limitations,  the  equations  should 
be  suitable  for  use  on  ungaged  watersheds  in  the  locale 
of  the  research  watersheds.     Measurements  or  estimations 
of  the  independent  variables  must  be  made,  and  some  of 
these  ma,y  be  difficult  to  obtain,  particularly  in  the 
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longer  equations.     The  variables  in  the  last  equation  can 
be  obtained  from  ccntour-ed  topographic  maps  and  from  esti- 
mations of  the  storm  intensity  and  volume  *     Design  storms 
will  require  estimates  of  the  T-year  intensities  and 
volumxCs,  v/hich  would  hopefully  give  the  T-year  peak  dis- 
charge rates  and  runoff  volumes*     This  is  conjecture  at 
this  point,  snd  frequency  analyses  of  the  data  would  have 
to  be  m^ade  before  this  relationship  could  be  used. 

Use  of  values  of  the  independent  variables  "outside" 
the  range  of  each  variable  is  usually  discouraged  in  lin- 
ear regression  equations.     The  reasoning  here  is  that  the 
equation  predicts  a  line,  plane,  or  hyperplane  v/hose  bound- 
aries are  the  limiits  of  the  measured  independent  variables. 
For  example,  a  simple  linear  regression  for  one  variable 
in  terms  of  another  would  yield  a  straight  line  through 
the  data  points  on  a  plot  of  the  two  variables.     If  only 
a  small  range  of  values  for  the  variables  was  used,  the 
line  may  be  only  a  segm-ent  of  a  larger  line,  which  may  be 
curvilinear.     Prediction  outside  the  range  of  the  variables 
v;ould  yield  a  point  on  the  extended  line,  but  this  line  may 
not  represent  the  total  curve.     The  actual  and  transformed 
variable  means  and  standard  deviations  listed  in  Appendix 
H  may  be  used  as  a  guide  in  determining  the  range  for  each 
variable . 
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Chapter  VI 

CONCLUSIONS  AND  RECOLIr.lE'NDATIONS 

CONCLUSIONS 

Two  objectives  for  this  investigation  were  presented 
in  Chapter  I.     The  first  v/as  the  determination  of  v^hich 
of  the  29  independent  variables  were  more  important  to 
the  dependent  varip.bles:     peak  discharge  rate  and  runoff 
volume  from  small,  central  and  eastern  Montana  v/atersheds. 
The  second  objective  was  the  derivation  of  regression  equa- 
tions for  the  dependent  variables  in  terms  of  the  important 
independent  variables. 

It  is  believed  that  the  &.nalyses  adequately  fiilfilled 
the  first  objective.     The  variables  which  contributed  most 
to  the  peak  discharge  rates  and  runoff  volumes  were  in 
general  agreement  with  the  most  important  variables  listed 
in  the  literature.     The  followdng  variables:  precipitation 
intensity,  standard  deviation  of  precipitation  intensities, 
soil  and  air  temperature,  v/atershed  azimuth,  overland  slope, 
watershed  shape,  reservoir  area,  and  watershed  area  were 
among  the  most  successively  imxportant  independent  variables. 
The  variables  related  to  snowTnelt  runoff  events  were  also 
important,  indicating  that  some  variables  were  important 
or  unimportant  because  snov/melt    runoff  events  were  included 
in  the  investigation. 
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The  principal-component  and  rotated-f actor  regression 
equations  for  the  peak  discharge  rate  and  runoff  volume 
from  the  watersheds  were  not  as  consistent  as  expected. 
The  equations  were  derived  from  independent  components  and 
factors,  and  therefore  satisfy  multiple  regression  as- 
sumptions.    The  equations  for  the  dependent  variables  in 
terms  of  29  independent  variables  exhibited  the  most  in- 
tuitively correct  relationships  among  the  variables,  V/hen 
variables  were  discarded  for  the  rotated-f actor  regression 
equations,  some  relationships  of  dependent  and  independent 
variables  became  intuitively  incorrect,  especially  when 
only  six  independent  variables  were  used. 

RECOIm:l£rTDATIOI^S  FOR  FUTURE  RESEARCH 

Because  V/allis  (1968)  published  his  paper  after  the 
present  investigation,  some  of  his  suggestions  were  not 
included  herein,  and  are  presented  as  recommendations  for 
continued  research.     The  following  analyses  are  suggested 
to  comply  with  his  recommendations:     Prom  the  rotated 
factors  of  Table  VII,  select  only  two  variables  from  the 
first  and  second  factors,  and  one  variable  from  the  third, 
fourth,  fifth,  and  seventh  factors,  yielding  eight  inde- 
pendent variables,   I,  AIRT,  AZ,  GrTDS,   SRP,  PC^m,  A,  and 
TPCP,  respectively.     Perform  a  principal  component  analy- 
sis of  the  correlations  of  only  these  variables,  compute 
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the  principal  factors,  and  I'otate  the  factors.     If  any  of 
the  variables  are  unimportant  to  all  the  rotated  factors, 
repeat  the  analysis  for  the  important  variables.     If  all 
the  factors  are  important,  compute  the  principal  component 
regression  equation  for  these  variables.     This  analysis  may 
or  may  not  yield  a  better  prediction  equation  than  any  of 
those  included,  but  the  results  should  prove  interesting. 

Because  the  inclusion  of  snov/melt  runoff  events  in  the 
present  investigation  caused  some  of  the  incorrect  signs 
in  the  regression  equations,   the  separation  of  snov^mielt  and 
rainfall-produced  runoff  events  is  recommended  in  future 
analyses.     The  methods  herein  are  suggested  for  use  in 
determining  important  variables  to  each  type  of  runoff 
event. 

The  information  about  the  important  variables  and  re- 
lated variables  could  be  used  in  any  future  study.  Al- 
though the  termination  of  the  measurements  of  the  unimpor- 
tant variables  is  not  recommended,   the  author  does  suggest 
that  concern  for  precise  measurements  of  many  variables 
may  be  "relaxed"  in  favor  of  better  measurements  of  the 
important  variables.     The  amount  of  data  used  and  the  use 
of  this  single  analysis  indicate  that  absolute  determina- 
tions of  totally  unim.portant  variables  are  not  contained 
herein.     This  report  should  serve  as  a  guide  for  any 
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future  cinalyses  v/hich  question  the  relative  importance  of 
certain  variables,   the  possible  relationships  of  certain 
variables  with  each  other,  or  the  prediction  equations  for 
the  dependent  variables o 

SUIvIT/IARY 

The  analysis  of  peak  discharge  rates  and  runoff  volumes 
from  five  central  and  eastern  Montana  watersheds  for  the 
important  causative  factors  and  the  principal  component 
and  factor  regression  equa.tions  is  contained  in  this  re- 
port.    Prediction  of  peak  discharge  rates  and  runoff  volumes 
for  ungaged  watersheds  in  the  locale  of  the  research  water- 
sheds is  recommended  with  the  developed  equations  if  certain 
liraite.tions  are  recognized.     The  results  obtained  and  the 
agreements  with  the  literature  suggest  that  multivariate 
statistical  methods  are  suitable  for  use  in  "multi-vari- 
able" hydrologic  studies  for  obtaining  information  about 
the  relative  importance  of  each  of  the  variables,  infor- 
mation about  the  intercorrelations  of  the  variables,  and 
satisfactory  regression  equations  for  the  runoff  variables. 
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APPEriDIX 
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APPENDIX  A 

Graphical  Derivation  of  Principal  Component  Theory 

The  Drocediire  for  finding  the  1^  ^  in  Equation  {^') 
can  be  vieired  graphically  to  give  a  physical  concept 
of  variance ♦     From  Equation  (^0,   the  jili  conponent  is 
seen  to  be  linearly  related  to  the  standardized  variables. 
If  the  va,riables  are  considered  as  axes  in  a  p~dimensional 
space,   then  any  principal  component  is  simply  a  line  through 
this  set  of  axes.     One  equation  of  a  line  in  p  dimensions 
is  (Kendall,  1957) 


x-i   -  m^       Xo      mo  x^  -  m„ 

J  1  =  J  £  =   .    .    .   =  -P  P  (Al) 


where  the  m^  are  intercepts  with  the  x^  axes,  and  the  1^ 
a.re  the  direction  cosines,  A  tiJo-dimension8.1  example  of 
this  form  is  shown  in  Figure  Al ,  where  1^  and  1^  are  the 
cosines  of  angles  0^  and  0^*  "^^^  notation  for  direction 
cosines,  1,  and  for  the  coefficients  in  Equation  (^)  is 
the  sa-me  because  these  values  are  later  shown  to  be  the 
same. 

The  measurements  of  the  variables,  after  being  stand- 
ardized,  could  be  plotted  as  points  in  the  p-dimensional 
space,   just  as  points  P^  are  shoim  in  Figure  Al ,  Finding 
the  component  which  reproduces  a  maximum  am-ount  of  the 
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A  ^2 


\^2^^12»  ^22^ 


X. 


Tf/To-dimensional  Principal 
Component 


information  is  identical  to  draT'fing  a  line  through  the 
points  in  a  manner  vrhich  minimizes  the  perpendicular  dis- 
tances of  the  points  to  the  line.     The  first  impulse  is 
to  write  an  equation  for  the  sum  of  the  distances  and 
then  minimize  this  SLim,    However,  some  of  the  distances 
are  negative,  and  cancellation  effects  are  encountered. 
To  rectify  this,   the  distances  can  be  squared,  eliminating 
the  negative  signs,  and  the  sum  of  squared  distances  can 
be  minimized,  which  is  simply  least-squares  theory. 

Mathematics  textbooks  give  the  equation  for  the  dis- 


tance from  point  P.   to  a  line  in  two  dimensions  e,s 

1 
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d. 

1 


Ax 


n-   BXo.     "  C 


2i 


if  the  line  has  the  equation 


Ax^  -r  =  C 


(A2) 


The  equation  chosen  for  the  line  in  FipA'JTe  Al  re- 


duces to 


17-  -1 


which  has  the  form  of  Equation  (A2),  Substituting  for  A, 
3,  and  C,  and  squaring;  6.^^  gives 


d.  = 


-^ki 


y~  1,  (xv^  - 


Since  p  dimensions  instead  of  tvio  are  used  in  this  study, 
then  the  squared  distance  from  the  i~  point  to  the  line 
of  Equation  (Al)  is 


k=l 


ki 
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and  the  sum  of  all  squared  dista-nces  for  the  n  observations 
is 


n  n 
H  S  -  nS  =  H 
i=l    ^  i=l 


^  ^  2 

\^k=i 


p 
k=l 


(A3) 


which  is  the  form  given  by  Kendall  (1957i  p.  1^).     It  is 

desired  to  find  the  m^^  and  1^  vjhich  minimize  this  sum.  Upon 

equs/bing  the  partial  differentials  of  this  equation  irith 

respect  to  the  m,,  to  zero,  Kendall  shears  that  the  line 

passes  through  the  origin  of  the  standardized  variables, 

and  the  m    are  all  zero.     Before  partial  differentiation 
k 

of  Equation  (A3)  vrith  respect  to  1,  ,   the  condition  that 

P  2 

H  Iv  =  1  (A^) 
k=l  ^ 

must  be  included,   since  the  squares  of  the  direction  co- 
sines of  any  line  must  sijin  to  unity.     Although  Kendall 
does  not  sho'-r  it,  he  adds  the  value 

p  2 
nL(II    1.      -  1)  =  0 
k=l 

to  Equation  (A3)  before  differentiating.     This  is  done 
to  assure  the  investigator  that  Equation  (A.^)  will  apply 
when  the  values  of  1  are  found.     The  value  L  (the  eigen- 
value) is  a  constant  that  may  be  zero,  but  it  m-ust  be 
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considered  iin3_ess  it  is  found  later  to  be  zero.  Dropping 

in  Equation  (A3),  adding  Equation  (A5),  and  differenti- 
ating with  respect  to  1    gives  the  set  of  equations 

K. 

1  P. 

T^K   ST  X-  .   X       -  L  1     =  0;      j  =  1.   2,  p  (A6) 

^  k=l    i-1  -"^1  J 

Since  the  correlation  between  two  standardized  variables 
is  (Harman,   196?,  p.  13) 

1  — 

then  Equations  (Ao)   can  be  irritten  as 
P 

X:  1^,  r       -  L  1.  =  0;     j  =  1,  2  p  (A8) 

k=l  ^ 

Noting  that  the  correlation  of  a  variable  with  itself  is 
unity,  Equations  (A3)  can  be  expa.nded  to 

l^(l-L)  +  l2  +  .   .   .  +  Ip  r^p  =0 

ll  ^21    +  12(1-^'  -2p    =  ° 


H  ^pl    +  ^2  V    +  •   •   •  +  lp(^-^'  =  ° 

where  the  1^^  and  L  are  to  be  dete mined.     These  equations 

can  be  written  in  matrix  form  as 

|R  -  Ll||ll  =  0 

XThere   [ll   is  a  vector  of  direction  cosines.     If  all 
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direction  cosines  vrere  zero,   then  all  the  a;n3les  between 
the  desired  line  and  the        axes  i-jotild  be  riG;ht  an2;les. 
This  means  that  the  line  would  be  perpendicular  to  all 
the  axes.     Since  this  is  not  likely,   the  1-vector  can  be 
eliminated  giving  the  matrix  of  Equation  (10)  in  Chapter 
III . 
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APPENDIX  3 

Grap_^h  1  ca  1_  De r  i  va  t i  on  o  f  _  Var  ipax,  H  o  ta  t i  on  Theory 

Graphically,   the  principal  components  are  reference 
axes  for  the  independent  va.riables.     If  two  components 
and  tuo  independent  variables  were  present,   then  the 
components  could  be  represented  by  the  axes  and 
in  Figure  Bl .     Since  the  com.ponents  are  linear  equations 


Figure  Bl.     Graphical  Representa.tion  of 

Principal  Components 


involving  the  standardized  independent  variables,  as 
shown  in  Equation  (^I-),   then  the  variables  can  also  be 
written  in  terms  of  the  s  important  components  (Kendall, 
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p.  16). 


The  variable  equations  are 


s 


X. 


(Bl) 


where  the  terns  are  the  same  as  for  Equation  (^) ,  If 
the  V.  are  considered  to  be  directional  unit  vectors,  then 
the  1,  .  are  projections  of  the  "variable"  vectors,  x,  , 
onto  the  V    axes,  and  Equation  (31)  defines  the  position 
vector  for  variable  x,    in  the  reference  frame  of  Figure  Bl . 

The  purpose  of  rotation  is  to  rota,te  the  axes 
and        through  an  angle  6  to  the  neir  positions,       '  and 

'  ,   so  that  the  projections  onto  the  new  axes  are  a.s 
large  as  possible.     This  means  that  the  neii  components 
will  have  either  large  or  small  coefficients  on  the  vari- 
ables, and  better  interpretations  of  the  importance  of 
the  variables  will  be  possible.     Coefficients  similar  to 
those  of  the  original  components  (i.e.,  coefficients  that 
are  large  for  all  the  variables)  V7ill  still  be  present, 
but  if  large  coefficients  are  maximized,  some  small  coef- 
ficients will  be  present  in  the  rotated  components. 

Kaiser  obta.ined  his  best  interpretations  from,  "normal 
varimax  rotation"   (1959).     In  order  to  use  his  methods, 
the  components  must  first  be  converted  to  factors  by  multi- 
plying th.e  coefficients  of  each  component  by  the  component's 
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eigenvalue 


,  as  shoTrn  by  Equation  (12),  or 


A. 
J 


(12) 


which  gives  the  jiti  factor  the  equation 


P 


if  Equation  {^■)  is  substituted  for  V..    (Ba^^galey  (196^, 
p.  256)  lias  concerned  with  the  effect  of  this  modification 
of  the  components  on  the  interpretation  of  the  variables, 
and  found  that  nothing  was  changed.     The  variance  of  a 
factor  is  the  same  as  the  variance  of  a  component  (Harman, 
p.  16),  and  the  contribution  of  both  to  the  total  variance 
i  s  the  same .  )  ■  • 

After  the  factors  are  found,  the  varia.ble  vectors 
are  given  unit  length  by  dividing  each  coefficient  by 
the  length  of  the  vector.     The  length  is  simply  the  square 
root  of  the  sum  of  squared  coefficients.     The  new  "norm- 
alized" variable  vectors  are  therefore  defined  by 


s 


(32) 


for  the  s  components,  where 
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and 


This  procediure  results  in  variable  vectors  having 
unit  lengths  and  plotted  in  a  reference  frame  -.Tith 
factors  for  axes,  as  shown  in  Figure  B2,     The  desired 


Figure  B2 . 


Normalized  Variable  Vectors  Plotted  in 
a  Factor  Reference  Fra.me 
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angle  of  rota.tion  is  still  9,  and  the  projections  to  be 
maximized  are  now  the  "b-^.,   the  projections  on  the  rotated 
fa.ctors.     As  with  the  principal  component  derivation  in 
Figure  Al,   soiue  of  the  projections  may  be  nega.tive,  and 
squaring  elimina.tes  the  minus  signs.     However,   the  sujh 
of  these  squared  projections  is  not  the  variance  of  a 
factor,  which  is  to  be  ma.ximized«     Graphically,   the  var- 
iance of  a  factor  is  found  by  subtracting  the  average 
squared  projection  on  the  j^  factor  from  each  squared 
projection,  and  then  squaring  this  difference  to  eliminate 
minus  signs.     The  average  squared  projection  on  the  kMl 
factor  is  given  by 

P  2 

  x:  ^,j 

if  there  are  p  variable  vectors.  Subtracting  this  from 
each  squared  projection  on  the  k—  factor,  squaring  the 
difference,  and  s-omming  for  all  factors  gives  the  sujn 

s  •  2 

W  =   H  (b        -  d,^)  (B^) 
j=l    l^J  ^ 

This  is  the  sujn  of  all  the  squared  differences  for  the 
p  variables  and  s  factors,  and  it  represents  the  s^jlil  of 
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squared  projections  on  the  rotated  factors.     If  this  siun 
is  divided  by  p,   then  an  "average"  squared  projection  on 
the  factors  is  approximated.     Kaiser  defines  the  variance 
of  a  factor  as  this  "average",  and  his  equation  to  be  niax- 
imized  is  given  (1959)  "by  Equation  (15)   in  Chapter  III, 
Equation  (15)  is  obtained  by  dividing  Equation  (3^)  by 
p,  and  expanding  the  squared  term. 

After  the  b,  .  which  maximize  Equation  (15)  are  found, 
the  coefficients  of  each  final  variable  vector  are  multi- 
plied by  the  lengths  of  the  original  vectors  to  place 
them  back  in  the  same  relative  perspective  before  they 
were  normalized,  rather  than  allowing  each  to  have  unit 
length  (Kaiser,   1959).     The  rotated  factors  therefore 
have  the  final  equations 

P 

A,.  «  =  ZI        .  X,  .  (B 

Ji        k^l  ^1 

where 

■  c,  .  =  hv  b,  , 
kj        k  kj 

and  the  c,  .  provide  the  information  for  interpretation 
of  the  importance  of  the  variables  to  the  rotated  factors, 
and  therefore  to  the  information  present  in  the  data.  The 
c      values  of  Equation  (22)  are  the  same  as  these  Cx^j. 
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APPENDIX  C 


Descriptions  of  Independent  yariables 

The  variable  numbers,  names,  and  definitions  listed 
beloTS'  describe  the  29  independent  variables  of  Table  II 
in  Chapter  IV«     Each  definition  gives  the  units  of  the 
variable  in  parantheses,   the  methods  of  obtaining  each 
variable,  and  the  source  of  the  data  for  each  measurement 
of  each  variable. 


1,  A       :     Area  of  watershed  (sq  mi)   from  SG3  soil  ma/p 

(scale:     ^■  in,  =  1  mi), 

2,  SHP  :     Shape  of  watershed  (dimensionless )  from  the 

ratio  of  the  length  of  the  longest  line  that 
could  be  passed  through  the  center-of -area  of 
the  watershed  to  the  length  of  a  perpendicular 
through  this  point.     Center-of -area  found  from 
the  balance  point  of  a  cut-out  of  the  water- 
shed from  3CS  soil  map. 

3,  AZ     :     Azimuth  of  main  chamael  (deg)  measured  clock- 

wise from  true  North  when  referred  to  the 
vrater-stage  recorder.     Mean  azimuth  of  straight 
line  sections  of  main  channel  cora.puted  from 

where 

AZ:j   =  azimuth  of  each  sti^ight-line 
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and. 


segment,  starting  at  gaging  station 
and  ending  at  closest  point  on  basin 
divide  to  end  of  main  channel  on 
SCS  soil  map, 


-  length  of  each  straight  line  seg- 
ment. 


k»     EISV:     Elevation  of  "i^ratershed  (ft)  computed  from  an 
average  of  the  elevations  at  0.2  and  0,8  of 
the  main  cha.nnel  straight-line-segment  length 
from  the  gaging  station  to  the  nearest  point 
of  basin  divide  from  head  of  main  channel. 
Elevations  from  contoured  topographic  map  and 
distances  from  char tome ter  of  SCS  soil  map. 

5.  GND3:     Kaxim_um  ground  slope  (ft/ft)  computed  by  divi- 

ding the  distance  from  the  head  of  the  blue, 
main-channel  line  on  U3G3  Quadrangle  m.ap  to  the 
nearest  point  on  the  basin  divide,  by  the 
change  in  eleva.tion  betireen  these  points. 

6.  GWDL:     Overland  distance  (mi)  of  flow  obtained  from 

distance  from  head  of  blue,  main-channel  line 
on  USGS  Quadrangle  map  to  nearest  point  on 
basin  divide, 

7.  FRSQ:     Stream  frequency  (1/sq  mi),  or  number  of  stream 

segments  per  unit  areao     Obtained  by  dividing 
the  tots-l  number  of  stream  segments  of  all 
orders  by  the  area  of  the  watershed.  Orders 
of  stream  segments  determined  by  numbering 
heads  of  tributaties  1^  order,  and  having 
-  --  two  ls_t  order  streams  joining  to  a  2nd  order, 

etc.     Count  taken  from  SCS  soil  mapo 

8.  S      :     Main  channel  slope  (ft/ft)  computed  as  the 

slope  of  a  straight  line  drat-m  through  the 
main  channel  straight-line-segment  profile. 
The  straight  line  is  fitted  to  the  profile 
in  such  a  way  as  to  give  the  same  area  under 
the  line  as  the  total  area  under  the  profile, 
using  equal  base  lengths o     Profile  from  contour- 
ed Quadrangle  maps  (for  Duck  and  E.  F,  Duck 
Creek  watersheds,  profile  from  trans it- stadia 
surveys ) . 
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9.     L      :     Main  channel  meander  length  (mi)  from  chartom- 
eter  of  SC3  soil  map  main  channel  (largest 
order  stream.)  ,  including  distance  from  end  of 
blue  line  to  nearest  point  on  basin  divide. 

10.  USE  :     Land  use  ratio  (dimensionless )  obtained  from 

a  ratio  of  the  vegetated  (including  forests  for 
Hump  Creek)  area  to  the  bare -cultivated  (  sujti" 
mer  fallow)  area.     Relative  areas  were  deter- 
mined subjectively  from  each  section  in  the 
aerial  photographs. 

11.  INFR:     Infiltration  rate  (in./hr)  of  watershed,  weight- 

ed for  rates  of  different  soil  types.  Computed 
from , 


where , 


INPR  =  - 


1^  =  average  measured  infiltration  rate 
on  each  soil  type  of  vratershed. 
If  not  measured,  soil  >7as  clas- 
sified by  3CS  method  (type  A,  3, 
C,  or  D) ,  and  the  average  for  all 
watersheds  was  applied  to  any 
soil  types  which  were  not  tested, 

and , 

=  area  of  each  soil  type  on  water- 
shed from  planimeter  of  SC3  soil 
map. 

12.     POND:     Distance-^-reighted  per  cent  of  total  area  in 
ponds  and  reservoirs  {%)  computed  from, 


I(L.   AP.  ) 
POND  =  -—J--—-—  (100) 
A  Zl^ 


where , 
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L.  =  straight-line-segment  distance 
along  main  channel  from  gauging 
sta.tion  to  each  pond  on  SGS  soil 
map. 


full  surface  area  of  each  pond  or 
reservoir  from  planimeter  of  SGS 
soil  map, 


and, 


A  =  area  of  via ter shed. 


Storm.  Variables 


13. 


Intensity  of  precipitation  (in./hr)  computed 
as  mean  of  all  hourly  intensities  from  all 
recording  precipitation  stations  from  "beginning 
of  storm  on  the  first  recording  station  to 
hour  of  last  runoff  from  hydrograph.  Begin- 
ning of  storm  determined  from  precipitation 
hyetographs.     (Last  major  snovrfall  used  for 
snovjraelt  runoff  events.) 


1^.     ISD  : 


Standard  deviation  of  precipitation  intensit- 
ies used  in  variable  13,  computed  from, 


2  1 
(ISD)  = 


n-1 


•where , 


n  =  total  number  of  hourly  intensities 
recorded  for  all  recording  precip- 
itation sta.tions. 


15.  D 


Duration  of  storm  (hr)  computed  as  an  average 
time  to  the  center  of  area  of  rainfall  hyeto- 
graphs  for  all  recording  precipitation  stations, 
computed  from. 


D  = 


JCdca.  a,  ) 

11 

TIT"" 
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and, 


DCA,  = 


where , 


DGAj_  =  tine  in  hours  from  beginning  of 
each  rainfall  hye tograph  to  the 
center  of  area  of  each  hyetograph, 
where  hyetograph  ends  vjith  last 
precipitation  before  hour  of  end 
of  runoff, 

A.   =  Thiessen  area  of  each  recording 
precipitation  station  froin  plan- 
imeter  of  SG3  soil  map, 

T.   =  time  in  hours  from  begi'nning  of 
hyetograph  to  where, 

I.   =  hourly  precipitation  at  the  respec- 
tive recording  station. 

Time  distribution  faxtor  (hr)  of  precipitation 
over  watershed,  computed  as. 


TDF  =  Dl  -  D 


where , 


Dl  = 


Z(DGAj  A.) 


and. 


and. 


DCA.  =  time  in  hours  from  beginning  of 

first  appearing  rainfall  hyetograph 
to  the  center  of  area  of  each 
hyetograph. 


A^  =  Thiessen  area  of  each  recording 
precipitation  station  from  plan- 
imeter  of  3CS  soil  map. 
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17.     TPCP:     Total  precipitation  (ln«)  computed  as  an 

average  Thiessen-ueighted  precipitation  from 
all  precipitation  stations,  or, 

liA^  Pi  ) 
TPCP  =  

where , 


=  total  storm  precipitation  at  each 
recording  and  non-recording  precip 
itation  station  from  beginning  of 
storm  to  end  of  runoff, 

and , 

A.   =  Thiessoii  area  of  each  station  from 
planimeter  of  SGS  soil  map, 

13.     API   :     Fourteen-day  antecedent  precipitation  index 
(in.)  computed  from, 

API^  =  Tr  (API^_-^)  +  (P^) 

vrhere,  _  • 

APIj^  =  antecedent  precipitation  index  for 
the  1th  day  after  the  fourteenth 
day  before  the  first  day  of  the 
storm, 


k  =  0.78,  a  reduction  constant  to 
indicate  the  evaporation- trans- 
piration losses  from  day  to  day. 
(Usually  ranges  from  O.8O  to  0.95 
(Chow,  p.  25-102),  and  small  value 
used  here  for  relatively  dry  vrater- 
sheds . ) , 

P.   =  Thiessen-Treighted  average  daily 
precipitation  for  each  station, 

and. 


APIi^ 


=  API  for  first  day  of  storm 
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Average  water  content  {/j)  of  soil  for  all 
stations  for  3»  9»  ^-nd  iB-inch  deipths,  at  be- 
ginning of  first  hyetograph. 

Predominant  vrind  direction  during  storm  period 
(dimensionless ) ,  obtained  from  weather  sta.tion 
data,  and  given  the  numbers  1,  2,  3»   ...»  B  to 
represent  a  wind  from  the  N,  N17,  VJ,  Sl'J,  S,  SS, 
E,  NS,  respectively. 

Week  of  the  year  (diraensionless )   in  which  the 
date  of  the  peak  discharge  rate  fell.  Weeks 
were  numbered  from  the  first  week  in  Jan, , 
regardless  of  the  number  of  days  in  this  week. 

Average  Thiessen-weighted  air  tempera. ture 

(deg  F) ,  k  ft  above  ground,  during  storm  period. 

Given  by, 

AIHT  =  ~- 


where , 


and, 


mean  hourly  temperature  at  a  sta- 
tion during  the  time  from  beginning 
of  first  precipitation  to  end  of 
runoff  hydrograph, 


A^  =  Thiessen  area  of  each  weather  sta- 
tion on  the  watershed. 


Average  Thiessen-weighted  standard  deviation 
of  air  temperatures  (deg  F),  ^  ft  above  ground, 
during  the  storm  period.     Given  by, 


2  1 
1  n-1 


n 


and, 
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Z(ST.   A.  ) 
AT3D  =  —  ~  ^ 

where, 

standard  deviation  of  hourly  air 
temperatures  at  a  vreather  station 
during  the  storm  period, 

n  =  number  of  houjrl^^  air  tempera. tures 
considered, 

Tj_  =  hourly  air  tempera tu-re  at  ^  ft 
above  ground  at  a  station, 

and, 

=  Thiessen  area  of  each  weather  sta- 
tion on  the  watershed. 

2^.     I'JVEL:     Ilean  wind  velocity  (mph)  during  the  storm  pe- 
riod. 

25.  WSD:     Standard  deviation  (mph)  of  hourly  wind  ve- - 

loci ties  during  the  storm  period, 

26.  SOLT:     Average  Thiessen-weighted  soil  temperature 

(deg  F),  3  in*  below  surface,  during  storm 
period.     Computed  exactly  as  air  temperature 
mean,  only  using  four  readings  per  day  at 
6-hr  intervals,  beginning  at  3=00  a.m. 

27.  ST3D:     Average  Thiessen-weighted  standard  deviation 

of  soil  temperatures  (deg  F),  3  in»  below  sur- 
face, dujring  storm  period.     Computed  in  the 
same  manner  as  the  air  temperature  sta,ndard 
deviation, 

28.  DEGD:     Average  Thiessen-Tieighted  Degree-Days  (deg  F) 

for  a  1^-day  period  prior  to  the  beginning  of 
runoff.  Degree-Days  for  each  day  was  maximu:m 
daily  temperature,  and  1^-day  average  was, 
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DEGD  = 


where , 


total  Degree-Days  for  a  irea-ther 
station  for  a  1^-day  period  before 
the  first  day  of  runoff, 


and, 


A. 

1 


Thiessen  area  of  each  weather 
station  on  the  watershed. 


29.     SVJEQ:     Snovf-T'iater  equivs.lent  (in,),  or  the  voliuiie  of 


water  in  the  snowpack  on  the  watershed  on  the 
date  of  the  begiiTaing  of  runoff.  Computed 
from  Thiessen-vreighted  volume  from  each  snow 
course.     Loss  or  gain  in  volume  from  date  of 
snow  survey  to  date  of  runoff  was  computed  as 
the  difference  between  accumulated  precipitation 
and  accujfiulated  runoff.     This  was  added  to  the 
Thiessen-weighted  voluine  on  the  watershed  3,t 
the  time  of  the  last  snow  survey,  giving  the 
volume  on  the  date  of  beginning  of  runoff. 
Thiessen  areas  of  snow  courses  were  used  to 
give  the  volume  on  the  date  of  the  s-orvey,  or, 


V  = 


where , 


snow-water  equivalent  at  a  snow 
course , 


and, 


Thiessen  area  of  each  snow  course 
on  the  watershed. 
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APPENDIX  D 

C 


C  THIS   PROGRAM  COMPUTES   VALUES   FOR  VARIABLES    13,  1^, 

C  15 t    16,    AMD    17  USING  RECORDING  AND   NON-RECORDING  PRECIP 

C  DATA    IN  USWB  FORM. 

C  INPUT   FOR  A  WATERSHED  CONSISTS  OF 

C  CARD   1        NO.   OF  RECORDING   PRECIP   ST AS.,    N0«   OF  RUNOFF 

C  EVENTS,    NO.   OF  NON-RECORDING   PRECIP  STAS.,  AREA 

C  OF  WSHD. 

C  CARD  2        THIESSEN  AREAS  OF  RECORDING  PRECIP  STAS. 

C  CARD  3        THIESSEN  AREAS  OF  ALL   PRECIP  STAS. -SUM   TO  AREA. 

C  CARD   4        LAST    FOUR   DIGITS   OF  STA   NOS.    OF   REC.    AND  NON- 
C  REC.    PRECIP  STAS. 

C  CARD   5       DATE  OF   PEAK  DISCHARGE,    DATE   FIRST  RECORDING 

C  PRECIP,    HOUR   FIRST   REC.    PRECIP,    DATE   FIRST  REC. 

C  PRECIP   EACH  STATION,   HOUR   -IRST   REC.  PRECIP 

C  EACH  STA.,    DATE  FIRST   NON-REC.    PRECIP   EACH  STA, 

C  DATE   END  OF   RUNOFF,    AND  HOUR   EiND   OF   RUNOFF,  ALL 

C  FOR   FIRST  EVENT. 

C  CARD   6       SAME  DATA  ON  CARD   5   FOR  SECOND   EVENT,  ETC. 

C  CARD   7       RECORDING  PRECIP  DATA    IN  ORDER  OF  STAS. -CARD  2. 

C  CARD  8       NON-REC.      PRECIP  DATA    IN  ORDER  OF  STAS. -CARD  2. 

C  CARD   9       BLANK  CARD   TERMINATES  PROGRAM. 


1  FORMAT    (  IX ,4AA , 5X  ,  I  2  ,5X,  I  2  ,  5X  ,  12  ,5X  ,  F8.2  ) 

2  FORMAT  (1X,7F8.2) 

3  FORMAT  (1X,U) 

4  FORMAT  (1X,3I2,5X,3I2,5X,I2,5X,3I2,5X,I2,5X,3I2,5X,3I2, 
/                    1  5  X  ,  I  2  ) 

5  FORMAT    ( 2X , I  A , 3 12 , 1 X , 12A3  ) 

60FORMAT    (IX,   3  5H  CARDS  OUT   OF  ORDER  FOR  STATION  N0.*I5» 

113Hi  BEGINNING  ON  »   2  (  I  2  » 1  H /-^  ,  I  2  /  1  9H  PROGRAM  TERMINATED  ) 
7  FORMAT    ; 2 X, 1^,3  12  ,10X,A4) 
200   FORMAT    (  IHl) 

80FCRMAT    (lOX,    37H  VARIABLES    13,    14,    15,    18,   MiD   19  FOR, 
.■  14A4,10H  WATERSHED  //15X,5H   PEAK,5X,5H  MEAN,5X, 
'    ■  212H  VARIANCE  0F,5X,llH  STD  DEV  OF,5X,9H   THIESSEN, 5X, 
■  39H  THIESSEN, 5X,13H  THIESSEN   AVG/15X,5H  DATE,3X, 

410H   INTENSITY, 2X,  12H    I  NT  ENS  I  T  I  ES  ,  4X  ,  1  2H    I  NT  EfJS  I T  I  ES  , 
■  5   5X,9H   DURATION,    4X,llH  TIME  DISTR,    5X,11H   TOT    PRECIP  ) 
90FORMAT    (    25X,    5H    (IN),      7X,    8H    (SO   IN),10X,5H    (IN),  8X, 
18H    (HOURS) ^6X,8H    ( HOURS ) , 1 0 X , 5 H    (IN)  //) 
100FCRMAT( 13X,2(  I2,1H/)  ,  12  ,5X, 

132H  MISSING  DATA   FOR   STATION   NO   240,I3,3H  ON, IX, 
22 ( 12 ,1H/ ) , I2,2X,27H  COMPUTE   VARIABLES   BY  HAND.) 
110  FORM AT ( 13X,2{I2,1H/),I2,5X, 

138H  ACCUMULATED   PRECIP  FOR   STATION  NO  240,I3,3H  0N,1X» 
22 ( 12 , IH/ ) , 12 ,2X ,27H  COMPUTE  VARIABLES   3Y  HAND.) 
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120FORMAT    (  1  3  X  ,  2  (  i  2  ,  1  H  /  )  ,  I  2  ,    AX,F6./4,    7  X  ,  F  7  .  5  ,  1  0  X  ,  F7  .    >  8X 

1F6.1 »SX,F6e 1 ,9X,F8.4) 
13   FORMAT  (2X»I4) 

ODIMENSIOri   THA{4)  ,THAN(7)  ,M5TAMC(7)  ,NW5HD{4)  ,rJYRP(  15)  , 
INMOP  (15)  *  NDAP  {  1  5  )  ,,NYRB  (15)  ,1^-103  (15)  ,NDAB  (15)  ,rJHR  (  15  )  » 
2NYR3B{  15,7)  ,  NMOBB (15,7)  ,NDA3B (15,7)  ,NHRR (15,7)  , 
BiWRBN  (15,7)  ,  NM03N  (15,7),  NDASN  (15,7)  ,  N  YR  E  (  1  5  )  ,  MMOE  (15), 
4NDAF  (15)  ,NHRE  (15)  ,MNDA3(  1  5  )  ,NNDAB,N  (15,7)  ,NNDAE  (15), 
5XN ( 1 5 ) , XSUM (15), X5UMSQ (15) ,SUMDA ( 1 5 ) , SUYA (15) 
ODIMEI\5ION   5UYAP{  15)  ,SSUMXT(  15,7)  ,5UMXT(  15  $7)  ,5UMPTA(  15) 
15UMX ( 15,7),  SSUMX (15,7)  ,L ( 24 )  ,KKK ( 15 )  ,SUMXSQ( 1 5 )  , VAR (15) 
2STDFV(  15  )  ,  XBAR(  15  )  ,uCA  (15,7)  ,D  (  1  5  )  ,  P':A(  1  5  ,  7  )  ,  T  L  (  1  5  )  , 
3PT ( 15 ) 
'     I  I K  0 
■  14  CONTINUE 

READ   1,    (NV.'5HD(  J  )  ,  J=l  ,4)  ,  I  I  ,NS,  JJ  ,A 

READ  STORM   CONTROL  CARDS   FOR  A  WATERSHED  AND  CONVERT 
DATES   TO  DAY  OF   THE  YEAR. 
I F ( A ) 1  59  ,  1  59  ,  1  5 

15  READ  2 , ( THA( J)  ,J  =  1  ,11) 
LL  =  I  1  + JJ 

READ  2 , ( THAN( J )  , J  =  l  ,LL  ) 
DO   68  KK  =  1  ,LL 
68  READ  3,MSTAN0(KK) 
DOl 68  K=l  ,NS 
DO      67  KK=1  ,LL 

OREAD   4 ,NYRP( K)  ,NMOP( K)  ,NDAP( K)  ,NYR3 (K)  ,NM03 ( K)  ,NDAB ( K  )  , 
INHR  (  K  )  ,N  YRB3  (  K  ,:<K)  ,  NM0B3  (  K  ,KK  )  ,NDAB3  (  K  ,!<K)  ,  NHRR(  K  ,  KK  )  » 
2NYR3N(K,XK)  ,  NM03  N  (  K  ,  K.K  )  ,NDABN(K,KK)  ,NYRE(K)  ,fiY,OE(  K)  , 
3NDAE(K) ,NHRE(K) 
NAA  =  rjMOB(K) 
IF(NAA)  172,172,173 
173   GO  TO   ( 16, 17,18,19,20,21,22 ,23,24,25,26,27) ,rMA 

16  NNDAB  (  K)  =NDAB  (  l<  )  -  . 
GO  TO  28 

172  NNDAB(K)=0 
GO  TO  28  ■ 

17  NNDAB(K)=NDAB(K)+31 
GO  TO  2  8 

18  NNDAB ( K ) =NDAB ( K ) +59 
GO  TO  28 

19  NNDAB ( K ) =NDAB ( K ) +90  • 
GO  TO  28 

20  NNDA3 ( K ) =NDA3 ( K ) +120 
GO  TO  2  8 

21  NNDAB(K)=NDAB(K)+151 
GO   TO  28 
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22  NNDAB ( K ) =NDAB { K) +1 81 
GO  TO  28 

23  NMDA3(K)=MDAB(K) +212 
GO  TO  28 

24  NNDAB { K ) =NDAB ( K) +243 
GO  TO  28 

2  5  NNDAB ( K ) =NDAB ( K ) +2  73 

GO  TO  28 

26  NNDAB ( K ) =NDAB { K) +304 
GO  TO  28 

27  MNDAB ( K ) =NDAB ( K ) +334 

28  NB3  ^NYRB{K)-62 
IF ( NBB ) 32 »32 , 1 74 

174  GO  TO   ( 32 ,29 ,32 , 32 ,32 , 29 ), NBB 

29  IF{NNDAB(K)-60)32,30,30 

30  IF{NM03(K)~2) 32  ,3  2  ,  3  1 

31  NNDAB ( K ) =NNDAB ( K ) +1 

32  NCC=NMOBN(K,KK) 
IF(NCC) 170,170,171 

171   GO  TO   {  33  ,34  ,35  ,  36,  37  ,38,  39  ,40  ,41  ,42  ,43  ,44  )  ,rJCC 

3 3  NNDABN ( K , KK ) ^NDABN { K , KK ) 
GO  TO  45 

170   NNDABN ( K,KK) =0 
GO  TO  45 

34  NNDABN(K,KK)=NDABN(K,KK)+31 
GO  TO  45 

3  5  NNDABN (K,KK)^NDA3N(K 

GO  TO  45 
36  NNDABN{K,KK) =NDABN(K 
GO  TO  45 
.  3  7  NNDABN (K,KK)=NDABN(K 
GO  TO  4  5 

3  8  NNDABN! K,KK ) =NDABN ( K 

GO  TO  45 

39  NNDABN( K ,KK) =NDABN { K 
GO  TO  45 

40  NNDABN  (K,K!<)=NDABN(K 
GO  TO  4  5 

41  NNDABN(K,KK)=NDABN(K 
GO  TO  45 

42  NNDABN( K,KK) =NDABN ( K 
GO  TO  45 

43  NNDABN {K,KK)=NDA3N(K 
GO  TO  45 

44  NNDABN  {  F;  ,KK  )  =NDABN  (  K 

4  5  NDD 


=NYRBN { K 


KK ) +59 
KK ) +90 
KK ) +120 
KK)+1 51 
KK ) +1 81 
KK)+212 
KK)+243 
KK) +273 
KK  )  +304 


KK ) +334 
KK) -62 


IF(NDD)49,49, 175 
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175  GO   TO    (49,46,A9,49,A9,^^6)  ?NDD 
4  6    IF    (NNDABN(K,KK 1-60)49,47,4 7 
4  7    IF    { NMOBN ( K ,KK) -2 ) 49  ,49  ,4  8 

4  8  NNDABN  (  K  ,  KK  )  :rn,ND  ABN  (  K  ,  KK  )  +  1 
A  9   NEE  =  NMOF(K) 

IF(NEE) 176,176,177 

177  GO  TO   (  50  ,  51  ,  52  ,  53  ,  54,  55,  S6»  57,58  ,59  ,60  ,61  )  ,[>iEE 

50  MNDAE ( K ) =NDAE ( K ) 
GO   TO  62 

176  NNDAE ( K ) =0 
GO   TO  62 

51  NNDAE(K)=rJDAE(K)+31 
GO   TO  62 

52  NMDAE{K)=NDAE(K)+59 
GO  TO  62 

53  NNDAE(K) =NDAE( K)+90 
GO  TO  62 

54  NNDAE( K) =NDAE{ K) +120 
GO  TO  62 

55  NNDAE{K)=NDAE(K)+151 
GO  TO  62 

56  NNDAE(K)=NDAE(K)+181 
GO   TO  62  ■ 

57  NNDAE ( K ) =NDAE ( K) +2  12 
GO   TO  62 

58  NNDAE(K)=MDAE{K)+243 
GO  TO  62 

59  NNDAE(K)=NDAE(K)+273 
GO  TO  62 

60  NNDAE(K)=NDAE{K)+304 
GO  TO  62 

61  NMDAE{K)^NDAE(K)+334 

62  NGG  =NYRE(K)-62 
IF (NGG) 66, 66, 178 

178  GO   TO    ( 66,63,66,66,66,63) , NGG 

63  IF ( NNDAE( K )-60 ) 66 ,64 ,64 

64  IF ( NMOE{ K) -2) 66,66,6  5 
6  5  iMNDAE(K)=NNDAE(K)  +  l 
66   XN ( K ) =0 • 

XSUM(K)=0. 
X5UMSQ( K ) =0, 
SUMDA(K)=0. 
'     5UMA ( K ) =0  .  ■ 
SUMAP(K)=0. 
KKK{K)=0 
SSUMXT ( K ,KK ) =0. 
5UMXT ( K , KK ) =0  . 
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SUMPTA(K)=0. 
5UMX(K,KK)-0. 
6  7   5SUMX{K»KK) =0. 
168  CONTINUE 
K=l 
1  =  1 

69  XHR^O* 
NC^O 
ZHR=0. 
NDAY=0 

READ  AND  STORE   PERTINENT   RECORDING  PRECIP  DATA. 

70  NDDAY^NDAY 

READ   5,   NSTANO,NYR,NMON,NDATE, (L( J)  ,J=1  ,12) 
READ   5,   JSTANO,KYR,KMON,KDATE, (L( J) ,J=13,24) 
I  F ( KMON-NMON ) 73 , 71 , 73 

71  IF(KDATE-NDATE) 73 ,72 ,73 

72  I  F { KYR-NYR )  73 , 74 , 73 

73  PRINT  6,NSTAN0,NM0N,NDATE,NYR 
GO  TO  159 

74  NR=0 

I  F ( N5TAN0-MSTAN0(  I )  ) 70  ,75  ,  70 

75  GO   TO   (  76  ,77,78,  79  ,80,  81  ,82  ,83  ,  84  ,85  ,86  ,87)  ,rjN^i0N 

76  NDAY^NDATE 
GO  TO  88 

77  NDAY=NDATE+31 
GO  TO  88 

78  NDAY=NDATE+59 
GO  TO  8  8 

79  N0AY=^NDATE+90 
GO  TO  8  8 

80  NDAY=NDATE+120 
GO  TO  88 

81  NDAY=NDATE+151 
GO  TO  8  8 

82  NDAY=NDATE+181 
GO  TO  88 

83  NDAY=NDATE+212 
GO  TO  88 

84  NDAY=NDATE+243 
GO  TO  88 

85  NDAY=NDATE+273 
GO  TO  8  8 

86  NDAY=NDATE+304 
GO  TO  88 

87  NDAY=NDATE+334 

88  NNYR=NYR-62 

GO   TO    (92  ,89  ,92  ,92  ,92  ,89)  ,NNYR 
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89  I F( NDAY-60 ) 92  ,90  ,90 

90  IFdNMON-Z)  92  ,92  ,9  1 

91  NDAY^NDAY+1 

92  IF(NR) 135,93,135 

93  IF(MDAY-NNDAB(K) )94, 160,94  ' 

160  IF  (  i\'YR~NYR3{  K  )  )  70  ,  103  ,  70 

94  IF (MC-1 ) 95  ,  101  ,95 

95  IFfNDAY-NNDABIK) ) 70,70,161 

161  IF{NYR~NYRB(K) ) 70,96,70 

96  IF ( NDAY-NNDAE ( K ) 198,97,98 

97  XMHRE=NHRE ( K ) 
5U3  =  XNHRE-1  . 
GO  TO  100 

98  IF(NDAY-NNDAE(K) )99»99 »107 

99  SUB^O.O 

100  XDAYS  =  NDAY"N,NDAB  (  K  ) 
NC  =  1 

ZHR=--ZHR  +  2  4  .^-XDAYS-SUB 
GO   TO  103 

101  IF ( NDAY- ( NDDAY+1 ))102,103,102 

102  XNDAY^-NDAY-NDDAY-1 
ZHR  =  ZHR  +  24  .^-XNDAY 
XHR  =  XHR  +  24  .--XrJDAY 

103  NC=1 

I  F ( NDAY-NNDAE ( K))105,l04,105 

104  JK  =  iNHRE(K) 
GO   TO  114 

105  I  F ( NDAY-NNDAE ( K ))  106  ,  106  ,  107  ■ 

106  JK=24 

GO  TO   1 14 

107  IF{K-NS) 108,111 ,108 

10  8    IF(NDAY-NNDAB(K  +  1  )  )  1 09  , 1 1 0  , 1 0 9 

109  I  F ( NDATE-NDABB (K+1,I))128,110,128 

110  K  =  K+1  -  - 
ZHR=0.0 

XHR=0.0 
GO  TO  103 

111  IF( I-I I ) 112,128,112 

112  IF(NDATE-NOABB( 1,1+11)128,113,128 

113  K=l 
1=1  +  1 

GO  TO   103  '  . 

114  DO   127J  =  1  , JK 

IF { NDAY-NNDAB (K)  1115,118,115 

115  IF(NDATE-NDABB(K ,1)1120,116, 120 

116  IF(J-NHRR(K, I  1  1 117,120,120 

117  ZHR---ZHR+1  .0 
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GO  TO  127 

118  IF{J-NHR(K) ) 12  7,119,119 

119  I  F{  J-iMhIRR  (  K  ,  I  )  )  1  2  1  ,  120  ,  120 

120  XF-:R^XHR+1  .0 

121  ZHR^ZHR+UO 
K1=L ( J ) / 10000 
NKK  =  K1-^10000 
J1=(L(  J)-rsIKK)  /lOO 
J3=K1/100 
KKKK=J3-100 
J2=;<1-KKKK 

IF { Jl  )  122  ,  122  ,  1  26 

122  IF(J2) 124,124,123 

123  17=1* 
KKK (K) =K 
NOO^NDATE 
NOOO=NYR 
NOON=NMON 
N0NN=N5TAN0 
GO  TO  128 

124  I  F ( J3 )  125  ,  123  ,  12  7 
12  5  ZZ=0. 

KKK(K)=:K 
NOO=MDATE 
•  NOOO^NYR 
NOON=NMOM 
N0NN=N5TAN0 
GO  TO  128 

C  COMPUTE  SUMS  AND  SUMS  OF  SQUARES   FOR   RECORD  IfJG  DATA, 

126  N=(  Jl-70)  +  (  J2-70)-10  +  (  J3-70)*-100 
X  =  N 

XXM=X/100. 

SUMXT ( K ,  I  ) =SUMXT ( K , I  ) +XXN-XHR 
S5UMXT(:<,  I  )=SSUMXT{K,  I  )+XXM*-ZHR 
.     S5UMX ( K , I ) =SSUMX( K , I ) +XXN 
XN(K)=XN(K)+1.0 
XSUM(K) =XSUM{ K)+XXN 
XSUMSQ(  K  )  =X5UMSQ  (  K  )  +XXN^-*2 
SUMX ( K ,  I) =SUMX( K  ,  I  ) +XXN 
12  7  CONT INUE 
GO  TO  70 

128  IF (K-NS) 12  9,130,129  -  . 

129  K=K+1 

GO  TO  131 

130  1=1+1 
K=l 

131  IF{I-II-1)  69,132,132 
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132  K=l 
1  =  1 

C  READ  AND  STORE  ALL   PERTINENT   NON-RECORDING  DATA  FOR  ALL 

C  STORMS. 

133  READ   7,NSTAN0,NYR »NMON ,NDATE ,NDSUM 
MM=  I  I-M 

IF(NSTANO-MSTANO(MM      ) ) 133,134,133 

134  KK= I  1+ I 
NR  =  1 

GO   TO  75 

135  K1=NDSUM/100 
NKK=K1^100 
J1=NDSUM-NKK 

-  K2=K1/100 
NKK1=K2^-100 
J2=K1-NKK1 
J4=K2/100 
NKK2=J4^'^^100 
J3=K2-NKK2 

IF{ Jl-20) 138, 136, 137 

136  XXN^^O.O 
GO   TO  140 

137  N=:  {  Jl-70  )  +  (  J2-70  )  -10  +  (  J3-70  )  ^-100+  (  J4-70  )  -x-1000 
X  =  N 

-  XXN=X/100. 
GO  TO  140 

139  KKK(K)=K  ■ 
ZZ=0.0 

NOO=NDATE 
NOOO=NYR 
NOON=NMON 
NONN=NSTANO 
GO  TO  144 

138  IF( J4~20 ) 140, 136, 137 

140  IF(NDAY-NNDABN(K,KK)  )  133,  162  ,  162 

162    IF(NYR-NYRBN(K,KK)  )  133  ,141  ,  133  •- ■  - 

141  IF(NDAY-NNDAEI K)  )  142  ,  143,  143 

142  IF( Jl-20 ) 899  ,900  ,900 

899  IF(J4-20) 139,900,900 

900  SUMX{K,K!<)=SUMX{K  ,KK)+XXN 
GO  TO  133 

143  IF( NDAY-NNDAE ( K ) ) 144 , 142 , 144 

144  IF(K-NS) 145,146,145 

145  K=K+1 

GO   TO  147 

146  K=l 
1=1  +  1 
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C  COMPUTE  VALUES  OF  VARIABLES   FOR  EACH  STORM. 

1A7    IF{ I-JJ-l ) 133»1A8,14  8 

148  DO   1 50  K=l ,NS 
DO   149    I -1  J  I  I 
SUMXSQ  (  :<  )  =  XSUM  (  K  )  '>-x-2 
IF(XN{K) ) 149,14  9,404 

404  VAR (K ) = ( XSUMSQ( K) -SUMXSQ ( K ) / XN { K ) ) / { XN ( K ) - 1 . 0 ) 

STDEV{ K) =SQRTF( VAR( K ) ) 

XBAR(K)=XSJM(K) /XN{ K) 

IF{ SSUMX(K . n  )  149,  149,40  5 
40  5  DCA(K, I  )=SUMXT(K»  I  )/SSUMX(K,  I  ) 

SUMDA(K)=5UMDA(K)+DCA(       I  )«-THA(  I  ) 

D( K  )  =SUMDA ( K ) /A 

'  PCA(K,  I  )-SSUMXT{K,  I  ) /SSUMX(K,  I ) 
5UMPTA{ K ) =SUMPTA( K) +PCA(K , I ) -THA ( I ) 

TL  (K)=SUMPTA(K)/A 

149  CONTINUE 

150  CONTINUE 
PRINT  200 

C  OUTPUT   TABLE   HEADINGS   AND   VALUES   OF  VARIABLES   FOR  ALL 

C  STORMS. 

PRINT    8,    (NWSHD( J) ,J=1 ,4) 

PRINT  9 

IJ 1= I  1  + JJ 
■    DO   1 52   K=l  ,NS 

DO   151    I-l , IJI 

151  SUMAP{K)=SUMAP( K)+SUMX(K, I )^THAN( I ) 

152  CONT INUE 

DO   157  K=l ^NS 
IF{K-KKK(K) ) 156,15  3,15  6 

153  IF{ZZ) 155,154,155 

154  PRINT    10,   NMOP{  K)  ,NDAP(K)  ,NYRP(  K  )  ,NONN,NO0N,fJOO»NO0O 
GO  TO  157 

155  PRINT    11,    NMOP(K)  ,NDAP(K)  ,NYRP(K)  ,NONN,NOON,fJOO,NOOO 
GO  TO  157 

156  PT  (  K  )  =SUMAP{  1<  )  /A 

OPRINT    12,    NMOP(K)  ,NDAP(1<)  ,NYRP(K)  ,X3AR(K)  ,VAR(K)  , 
ISTDEV(K)  »D(K)  ,TL(K)  ,PT(K) 

157  CONT INUE 

153  READ   13 ,  NSTANO 

IF(NSTANO) 15  8,  14,158 
159  CALL   EXIT  . 

END 
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C  APPENDI X  E 

c  ; 

C  •    ■                       CORRELATION  COMPUTER  PROGRAM 

C  FROM   COOLEY  AND  L0HNE5 

C  INPUT    IS  ACTUAL  RAW  OR   TRANSFORMED  DATA     TO  BE   READ    IN  BY 

C  ■    ROWS   FROM   A  MATRIX  WITH  DIFFERENT   TRIALS   AS   ROWS   AND  VAR- 

C  lABLES  AS  COLUMNS.      ALL  VALUES  FOR  EACH  TRIAL    (ONE  ROW  OF 

C  THE  DATA  MATRIX)    ARE   READ   AND  TREATED  BEFORE  THE   NEXT  ROW 

C  IS   READ.      THIS   ELIMINATES   STORAGE  OF  THE  WHOLE  DATA  MAT- 

C  ■  .    RIX.      INPUT   AND  OUTPUT   FORMAT  STATEMENTS  MUST   BE  CHANGED 

C  TO  ACCOMODATE   THE  DATA.      THE   FOLLOWING   VARIABLE  NAMES  ARE 

C  USED   IN  THE   PROGRAM.      THE   FIRST   FOUR  MUST   BE   READ    IN  ON 

C  A  CARD    IMMEDIATELY  PRECEEDING  THE  DATA  AND  COMPLYING 

C  ,    ■  WITH  THE  FIRST   FORMAT,      THE  REST   DEFINE   THE  OUTPUT. 

C  M^THE  NUMBER  OF   VARIABLES   TO  BE  CORRELATED  WITH 

C  EACH  OTHER. 

C  N=THE  NUMBER  0^  OBSERVATIONS  OF  ALL  M  VARIABLES. 

C  LPUNCH=0    IF  CORRELATION  MATRIX    IS  TO  BE  PUNCHED. 

C  =1    IF  CORRELATION    IS  TO  BE   PRINTED  ONLY. 

C  LPRINT^O    IF  ALL  COMPUTED   PARAMETERS  AND  MATRICES   ARE  TO 

C  BE   PRINTED  AND  PUNCHED. 

C  =1    IF  ONLY  CORRELATION  MATRIX   IS   TO  BE  PRINTED. 

C  SX(I)=    THE   SUM   OF  ALL  VALUES   OF  A  VARIABLE  WHERE    I  IS 

C  TAKEN   FROM    1    TO  M, 

C  X(I)=   A  VALUE   OF  A  VARIABLE   WHERE    I    IS  THE  SUBSCRIPT 

C  OF  THE   PARTICULAR  VARIABLE. 

C  SS{I»J)=   A  MATRIX  ENTRY  WHICH    IS   EITHER  A  SUM  OF  THE 

C  SQUARES   OF  ALL  VALUES  OF  A  VARIABLE  OR   A  SUM  OF 

C  ".                 THE  CROSS-PRODUCTS   OF  ALL   VALUES   OF   TWO  VARIAB- 

C  LES. 

C  ■      ■55D(I»J)=A  MATRIX   ENTRY   IN   THE  DEVIATION  SUMS  OF  SQUARES 

C  ■                            MATRIX,   COMPUTED   FROM   THE  VALUES    Hi  THE 

C  PREVIOUS  MATRIX. 

C  D{I,J)=  A  MATRIX   ENTRY   IN  THE  VAR I ANC E-COVAR I ANC E  MATRIX 

C  COMPUTED  AS   SSD ( I , J ) / N- 1 . 

C  SD(I)=   A  STANDARD  DEVlATIOfsl  COMPUTED  AS  THE  SQUARE  ROOT 

C  OF  THE  VARIANCES, D(  I , I )  ,    1  =  1, M 

C  '        R{I»J)=   A  CORRELATION  COEFFICIENT    IN   THE  CORRELATION 

C  MATRIX. 

C  OUTPUT   FROM   THE   PROGRAM   IS  LABELLED  THROUGH  OUTPUT  FORMAT 

C  STATEMENTS. 

ODIMENSION   X(32},SX(32) ,SS(32,32) ,SSD{32,32) ,D(32,32) , 

1R(32,32),SD{32),XM(32) 
COMMON   X  ,M 

READ   1,   M,   N,   LPUNCH,  LPRINT 
1   FORMAT  (12,12,11,11) 
NTRIAL=N 
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C  COiMPUTE  SUMS,   SUMS  OF  SQUARES,    SUMS  OF  CROSS-PRODUCTS 

DO  2  I=:1,M 
5X{ 1)^0*0 
DO  2   J=I ,M 

2  SS (  I  *  J ) =0  oO 

3  READ  4 ,    ( X {  I  )  , 1= 1 ,M ) 

4  FORMAT    ( ( lOX , 10F7.0 ) ) 

CALL   TRANFO  ,  ^ 

DO   5  I=1,M 

SX(  I  )=SX(  I  )  +  X(  I ) 

DO   5  J=  I  ,M 

5  SS(  I  ,J)=^SS(  I  ,J)+X(  I)^-X(J) 
NTRIAL=NTRIAL-1 

XN  =  N 

C  SET   LOWER   TRIAGLE  EQUAL  TO  UPPER   TRIANGLE  A,ND  COMPUTE  THE 

C  DEVIATION   SUMS  OF  SQUARES   AND  CROSS-PRODUCTS  MATRIX 

IF    (NTRIAL)  6,6,3 

6  DO   7    1=1 ,M 
DO   7  J= I ,M 

S5D(  I  ,J)  =  ( XN-SS{  I,J)-SX(I)*SX{J))/XN 
SS( J,  I  )=SS(  I ,J) 

7  SSD( J, I )=5SD( I ,J) 

C  COMPUTE     STANDARD  DEVIATIONS 

DO   8  I=1,M 
"    XM (  I  ) =SX {  I) /XN 

8  SD ( I ) =5QRTF( SSD( I , D / ( XN-1 .0 ) ) 
PRINT  9 

PUNCH  9 

9  FORMAT    (    1H1,30X,2  7H  CORRELATION   PROGRAM   OUTPUT  /) 

C  OPTION   TO  PRINT   NUMBER  OF  VA R I  A3 L E S , NUMB E R  OF  OBSERVA- 

C  TIONS,   MEANS,    STANDARD  DEVIATIONS,   SUMS  OF  SQUARES, 

C  DEVIATION   SUMS  OF  SQUARES 

IF    (LPRINT)  13,10,22 
10   PR  INT    1 1 ,    M  ,N 
PUNCH   11,  M,N 
llOFORMAT    (    25X,    4H   FOR, 13,    15H  VARIABLES   WITH, 13, 
113H  OBSERVATIONS  //) 
PRINT  12 
PUNCH  12 

12  FORMAT    (    25X,    37H  MEANS    IN  THE   ORDER  OF  VARIABLE    INPUT  /) 
PRINT    13 , ( XM ( I ) , 1=1 ,M ) 

PUNCH   13 , ( XM (  I )  , I  =  1  ,M) 

13  FORMAT (( 1AX,6F10. 6) / )  ' 
PRINT  14 

PUNCH  14 

14  FORMAT    (    27X,    33H  STANDARD  DEVIATIONS  OF  VARIABLES  /) 
PRINT    15, ( SD(  I  )  ,  1  =  1  ,M) 
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PUNCH   1  5  ,  ( SD (  I )  , I  =  1  ,M ) 

15  FORMAT (( 14X,6F10.6 )/ ) 
PRINT  16 

16  FORMAT    (    23X,    35H  SUMS  OF  SQUARES  AND  CROSS  PRODUCTS* 
17H  MATRIX  /) 

17  FORMAT    {14X,4H   ROVi ,  I  3  /  (  1  4  X  ,  6  F  1  0  .  4  )  /  ) 
DO   18  I--:1,M 

18  PRINT    17,  I  ,  (SS{  I  ,J)  , J=l  ,M) 
PRINT  19 

190FORMAT    (    18X,    36H   DEVIATION   SUMS   OF   SQUARES   AND  CROSS* 

116H   PRODUCTS  MATRIX  /) 
2  0   FORMAT    (14X,4H   RO W , I  3 / ( 1 4 X , 6 F 1 0 . A  ) / ) 
DO   21    1=1 ,M 

21  PRINT   20  ,  I  ,  { SSD(  I , J)  ,  J=l ,M ) 

C  COMPUTE  VARI ANCE-COyAR lANCE  MATRIX 

22  DO  23    1=1 ,M 
DO   23  J= I ,M 

D(  I  ,  J ) =SSD(  I  ,  J ) / { XN-1 . 0 ) 

23  D ( J  5  I) =D (  I , J ) 

C  OPTION   TO   PRINT   VAR I ANCE-COVAR I ANCE  MATRIX 

IF    (L PR  I  NT)    2  4,24  ,28 

24  PRINT  25 
PUNCH  25 

2  5   FORMAT    (    30X,    2  7H  VAR I ANC E-COV AR I ANC E  MATRIX  /) 

26  FORMAT    (14X,4H   ROW ,  I  3 / { 1 4X , 6 F 1 0 . 5 ) / ) 
DO  27    1=1 ,M 

PUNCH  26 , I , ( D( I , J ) , J=l ,M) 

27  PRINT   26,  I  ,  ( D(  I ,J )  ,  J=l ,M) 

C  COMPUTE  CORRELATION  MATRIX 

2  8  DO  29    I =1 ,M 

DO  29  J= I ,M 

R{  I  ,  J  )  =D  (  I  ,  J  )  /  (  SD(  I  )  ^^-SD(  J  )  ) 

29  R( J  ,  I  ) =R (  I , J) 

C  OPTION   TO   PUNCH  CORRELATION  MATRIX 

PRINT  30 
PUNCH  30 

30  FORMAT    (    34X,    19H  CORRELATION  MATRIX  /) 

31  FORMAT    (14X,4H   ROW , I  3 / ( 1 4 X , 6 F 1 0 . 6 ) / ) 
DO   32    1  =  1  ,M 

3  2   PRINT   31  ,  I  * ( R(  I , J )  ,  J  =  l  ,M) 

IF    ( LPUNCH) 33 ,33 ,35 

33  DO   34    1=1 ,M 

34  PUNCH   31 »  I  » { R (  I  *  J )  ^ J=l »M) 

35  CALL  EXIT 
END 
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SUBROUTINE  TRANFO 

DIMENSION  X(32) 

COMMON  X,M 

DO   5  I=1,M 

IF (1-28 ) 2  ,  1  »2 

1  X(I)=X{I)+32.0 

2  I  F  (  X  (  I  )  )  4  ,  3  ,  4 

3  X( I ) ^eOOCOOl 

4  X (  I  ) =AL0G1 0 ( X (  I  )  ) 

5  CONTINUE 
RETURN 
END 
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C  APPENDIX  F 
C 

C  PRINCIPAL  CGMPc^NENT  PRG3RAM 

c  rR9M  ceeupv  and  uohnls 

C  THIS  PReGP^M  PERrt^RMS  PR  I  NC  I  P  AL-COMPeNENT  ANALYSIS 

C  THAT  Bt^INS  '.''ITH  EITHER  THE  CfiRRELATRlN  6R  DISPEPSTjN 

C  M ATP  IX/   RATHIPP  THAN  RAii  SCPPES*   N ATP  ICES  UP  Jd  60TH 

C  ^RDPR  ARE  POSSIBLE^    BUT   THE  DIMENSION  STATEMENT  COULD 

C  EASILY  gr   MODIFIED   IF  LARGER  MATRICES  WPPE  NECESSARY, 
C 

C  INPUT. 

C  Ce'NTR3L  CARD   1   CeNTAIN3    (CeL   l«if3)    =   PROBLEM  IDEN- 

C  TIFICATieN  WHICH  ^'AY   BE   ALPHABETIC,    (C6L  49-oO)^SIZE  eP 

C  MATRlV(pIGHT-JU3TIFlEn)    IF   M=Oi    THEN  CALL  EXIT*  THE 

C  MATRIX  F0LL9WS  CARD   1   AND   IS  READ    IN  BY  A  SU3RSUTINE 

C  WHICH  MUST  3F  ST^f^^ED  TO  THE  RIGHT   OF  THE  MAIN  DIAGONAL 

C  (7F  R«     SUPROUTINE  HDIAG    IS  REQUIRED* 
C 

C  eUT^UT. 

C  PRINTED  ^-UTPUT   INCLUDES  THE  R93TS  9F  MATRIX^  NORM- 

C  ALIZED  'VECTBRS^   PERCENTAGE  QF  TRACE  ACCOUNTED  Fe^  BY  THE 

C  ROf>TS>   AND  THE  FACTOR  LnADINGS, 
DIMENSieN  PROB  (1?) 

DIMENSI8N  R(60'6Q) f SS(60^60)# IQ(60)/FRACT(60)/X(60) 
rnEGER  °RS3 

C  READING   I^'  THE  CeNfROL  CARD 


100  READ   (105,  lai  )  pReSiM 
1?1   FORMAT  (  1.PA4/  I  ?  ) 

IF(M)      lOOOi  lOOCiPlO 
210  W^'ITE  (108>2nPReB 

WRITE    (106/31) PROP 
21   tTRRMATc  iMl/24Xi  1?A4/// ) 

WRITE( 10?;25) 

WRITE' 106^25) 
25  F?RMAT(30X,2SH  5RIGINAL  CgRRELATION  MATRIX/) 

CALL  RHDIAG(R/M) 

T    =  M 

.     CALL  HDIAG    (R/M4  0.iSS^NR)  '  . 

WRITE(  103/710)  NR   

WR I TE( 106/710)  NR 
7lOOFeRMAT(iuo,22X>31H  NQt   PF  ROfATjeNS  FRcOM  JACOBIAN/ 
113H  SURpOUTINE   =/ 13//) 
WRITE  (lOB/1) 
WRITE  (106/1) 

1   Ff}RMAT    (P2X/36H  CH  AR  ACTEP  I  ST  I  C  POSTS  RF  CDRRELATION/ 
MATRIX/) 
WRITE    (lOS/3)    (R(  li I )/    I  =  1^M) 
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WRITE 

(106/3)    (R(I/I)^  I=1>M) 

3  F5RMA.T 

( ( 14X/ 6F10?6 ) ) 

WR  I  TE 

(108  f  ) 

WRITE 

(106^'>) 

40EPRM.AT 

(1M0,3?X^24H  NORMALIZED  EIGENVECTaR 

S/25X/ 

13SH  N9 

TE  THAT  VECTORS  ARE  WRITTEN   IN  R9WS 

/) 

-   5  FORMAT 

(  ( l^Xi  6R10*  ^ ) ) 

Da  6 

J  -  1 M 

WRITE 

(106i7)   J/ (SS( N J)*    I    -  1*M) 

6  WRITE 

(108/7)   J/(ES(I^J)#    I    =  1/M) 

7  Fe}RMAT 

(  UXi7H  VECTSR/  I3/(  14X.6F10»6)  ) 

10 

I  ^  1  /  M 

10  FRACT ( 

I)  =    (R( 1/ I )   /  T) *100.0 

WR  I  TE 

(108/11) 

WRITE 

(106/11) 

llOF'^RMAT 

(1H0/15X/37H  PERCENTAGE  6F  VARIANCE 

ACC9UNTED 

12CH  '3V 

EACH  EIGENVECTGR  /) 

WRITE 

(lOa/5)  (FRACT(  I)/  Ul/M) 

WRITE 

(  106/5)  (FRACT(  I  )/  Ul/M) 

DO'  111 

I  =2/M 

111  FRACT 

( I )    =  FRACT    ( I )   ^  FRACr   ( I-l ) 

WR  I  TE 

(108/11?) 

WRITE 

(106/112) 

11?  F  E  R  M  A 

(l'^0/2aX/23H  ACCUMULATED  PERCENTAGE/) 

WRITE 

(lOS/5) (FRACT    ( I )/    I    =  1/M) 

WRITE 

(  106#5)  (FRACr    ( I) i    I    3  1/M) 

WRITE 

(108/113) 

WRITE 

(10  6/113) 

mOFORMAT 

(  1H0/37X/ lifH  FACTdR  MA  fR  I  X/25X/ 1  OH 

N8TE  THAT/ 

125?H  FACTORS  APE  WRITTEN    IN  ROWS  /) 

12 

I    =  1/M 

12  Rdfl) 

=   SQRT(R( 1/ I ) ) 

DO  13 

J   =  1/M 

De  13 

I    =  1/M 

13  5S( li J)    =   SS( I . J)    *  R( J/ J) 

OR   1  ^ 

J    =  1/M 

WRITE 

(106/lS)    sJ/  (SS(I/J)/    I  =1/M) 

U  WRITE 

(  lOS/15)   Jj  (3S( 1/ J)/    I  =1/M) 

15  FeRMAT( 1^X/7H  F ACT8R/ I  3/ ( 1 4X/ 6F 1 3 » 6 ) ) 

GS         ICO  •  .  .  .. 


loco  CALL  EXIT 
END 
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DIMB:^'SIO.^'  R(60/6G) 

1  reRMAT   (14Xi4N  RG'a,  l3/(  17^X>6riO«6)  ) 

2  F-f^'RMAT(  (  14X^6riO«£  )  ) 
oe  3  I=1»M 

READ         (R(  I.  J)/ J^/M) 

3  PRINT   1/    1/    (R( J)/ J^l^M) 
RfJTURN 

END   

C  SUBReUTINE  HD I  AG? 

C 

C  PROGRAMMED  EV  F^J^   C8RBAT8   AND  M,   MERWIN  9F   THE  M, 

C  NT.   CRMP^^UTATIQN  CENTER* 

C  THIS  SUBROUTINE  COMPUTES  THE.  EIGENVALUES  AND  EIGEN« 

C  VECT5PS         A  REAL  SYMMETRIC  MATRIX,   H^   df  ORDER  N  (WHERE 

C  N  MUST  BE  LESS  THAN  61 )^    AND  PLACES  THE  EIGENVALUES  IN 

C  THE  DIAG^^NAL  ELE^iENTs  CP  THE  MATRIX         AMD  PLACES  THE 

C  EIGEMVrCTtlRS    (NSRMALiZED)    IN  THE  Cf3LUMN5  5r    THE  MATRIX  U 

C  lEGRM   IS  SET   AS   1    IF  ONLY  EIGENVALUES  ARE  DESIRED/  AND 

C  IS  SET  TP  0  WHEN  VECTORS  ARE  REQUIRED*     NR  CONTAINS  THE 

C  NUMBER  er  ROTATIONS  DfiNE« 

C 

SUBROUTINE  HDIAG    ( H^ N ^ rGEN vJ/ NR  ) 

D  I  MENS  I  e^:  H  (  60i  60  )  /  U  (  60^  60  )  /  X  (  60  )  M  Q  (  60  ) 
IFdEGEN)  ^ 

10  DP  1'^  1  =  1  iN 
D^  1^+  U  =  l/N 
IFd-U)  12/11'12 

11  U(I/J)sl.C 
G^^  jri  \H 

12  U(  1/ J)=^C- 

14  CPNTIMUE 

15  N'R   £  0 

IF    (N-l)      1000.i  IGOOi  17 
C  SCAN  FOR  LARGEST   ePF-D  I  AGONAL  ELE^'ENT    IN  EACH  Rd'^l 

C  Xd)   CONTAINS  LARGEST  ELEMENT   IN    I  TH  R9W 

C  IQ(I)   HQLDS  SECOND  SUBSCRIPT  DEFINING  P0SITI3N  QF  ELEMENT 

17  NMI1=N.1 

D*^  30   I  =  UNMI1 

X( I)    =  Q. 

IPL1=I+1 

D5  30  ja  IPLN^'! 

IF(    X(I)    -   ADG(    H(IiJ)))  20^20^30 
20  X(I )=ABS(H( I. J) ) 
I0( I ) =J 

30  CONTINUE 

C  SET   IMDICATQR  F&R  SHUT-SFF .  RAPs2«-^"27^  NR  =  N8 .   OF  ROTATIONS 
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C  FIND  ^■'^AXIMUM  '^F   X  (  I  )    S  ToR  PIVGI   FLE'^.ENT  AND 

C  TEST  fdP-  tMD  RF  problem 

40  D6   70  I-1/NMlt 
IF    (I-n      60/ 60/ 

^15   IF    (    XMAV*-  X(I  )  )  60W0/7G 

60  X^^AX  =  X(  i  ) 

JPIV'^IG(I) 
70  CRNTP'UE 

C  IS  MAX«   X(I)   EQ'JAl,  TO  ZER8i    IF  LESS  THAN  HDTEST/  REVISE 

C  HDTEGT 

IF    (    XMAX)      1000/ 1000/ SO 
80    IF    (HDTFST)  30,90^85 
S5   IF    (X'MAX   «  HDTESl  )  yO-?50/US 
90  HDIMIN   i'.   ABS(   H(  1/  1  )  ) 
110  1= 

IF    (HDIMIN-   A^^S(    H(I^I)))  110<?110il00 
ICO  HDIM  I  ri-A'^S  (H  (  I  i  I  )  ) 
110  C^-MTINUE 

NDTESTsHDIMIN-^RaP 
C  RFTURV!   ir  MAX*H(  I^  J)LeSG  THAN  (  2^  *-27  )  ARS  (  H  (  Kj  K  ) -M  I  N  ) 

IF    (HDTE-^T-  XMAX)    US;  1000^  1000 

C  C/'MPIJTF   TANGFMT,    SINE   AND  06:3 1  NE  .  H  (  T  ,  I  ) /H  (  J,  J  ) 

150   TANG  =  5IGN(R,v},  (H(  IPIV>  I  P  I  V  ) -h  (  JP  I  V  i  JP  I  V  )  )  )*H(  IPIV.  JPIV)/ 
KARSC-H  I^rV/  rMV)«H(  JPIV/ JPIV)  )4-s::RT(  (H(  IPIVi  IPIV)-H( 
2J-IV^  JPIvO  )*^2^4tC*H(  IPIV^  JPIV)^^-?)  ) 
Ces  I  NE  =  1 .  0/SQRT  ( 1 .  0-^TANCU^2  ) 
SINF  =  TANG*'CfjSINF 
HI I=H( IFI V/ IPIV) 

H(  IPI  V,  IPI  V)  =C9SINE**2*  (HI  I  +  T  ANG*  (  2 .  *  H  (  IPI  V  /  JPIV  )  -i-TANG^H 
1 ( JPI JPI V)  )  ) 

H( JPIV^ J^I V) =C0Sn>E**2*(H( JPI V/ JPI V) -TANQ*(2.*H( IPI V/ JPI 
1V)-TA^'G*HI  I  )  )   _  _   

H  (  I  P  I  V  /  J»  I  V  )  =  0  .  "  ~"  ~"     ■ ' ■ 

C  PSEUDf  RANK  THE  EIGENVALUES 

C  ADJUST   SUiE  AND  CeS  FOR  CGMPUTATIQN  S'F  H(IK)    AND  U  (  IK) 

IF    (   H(IPIVMPIV)    -  HC  JPIVi  JPIV)  )  152il53^153 
152  WTEMP   g   M( IPI IPIV) 

H(  IPIV/ IPI V)    =   H( JPI V/JPIV) 

H(JPIV,JPIV)    =  HTEMP 
C  RFC?MPUTE  SINE  AND  C&S 

HTEM^   =   SIGN   (1,0^-SINE)    *  COSINE 

CeSINF   =   ABS  (SINE) 

SINE  =  HTEMP 
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153  CeNTI^^UE 

C  I^ISPZCJ  THE   ImS  3ETWE.EN   I  +  l   AND  N«l   TO  DETERMINE 

C  WMETHER  A   NEW  MAXIMUM  VALUE  SHBULD  BE  CBMPUTED  SINCE 

C  THE  "niESENT  MAXIMUM   IS   IN  THE   I   8R  J  RBW» 

06  3on  I-i/NMIl 
IF(I«IPIV)  ai0>350/200 
SCO   I^d^JPiy)  210^3bO/21C 
210   IE    (in(I)-IPI\/)  230.P^4Ci230 
?3C         (in(I)-JPlV)  350/2^0,^350 
2^0  K^ir](  I  ) 
250  HTE^^^-=M(  UK) 
H (  I  ^  <) ^  0  * 
IPLl^^I+i 
X(I)  ^0* 

C  SFAF^ru   iM  DEPLETED  R&W  F9R  NEW  MAXIMUM 

Oe  320  J=IPL1/N 

(   X(I)-AB3(   H(IiJ))    )  300*300/320 
300  X(  n    =   ABS(H(  NU)  ) 

iQ(n=j 

320  CPNTJN'UE 

M(I,!<)^HTEMP  ' 
350  Cf'NTINUE 

X  (  I  P  I  V  )     0  . 

X(UPP.M  =0« 
C  CHANGE  THE  OTHER  ELEMENTS  QF  H 

DP  530  I=l/N 

IF(I«IPIV)  370^530/420 
370  HTEM.P   -   H(Ii  IPI V) 

H(I/IPIV)    =   C9SINE>^HTEMP  +  3  I  NE  *H  (  I  /  UP  I  V  )   

IF    (    Xd)    -  A^5(   H(I^IPIV))    )  380/390^390 
380  X(  I )    =   A^S(H( 1/ IPIV)  ) 

IQ(I )    s    IPI V 
390  H(I/JPIV)    =   "3INE*HTEMP  +  CBS  I NE *H ( I / UP  I  V ) 

IF    (   X(I)    -  A?^S(   Hd/UPiV))    )  ^00/530/530 
400  X(  I)    =   A5S(H(  I/UP^IV)  ) 

IQ(I )    =  UPIV  ■ 

GB  Ta  53'^ 
420   IF    (I-UPIV)  430/530/480 
4  30  MTEMP   s   M( IPI V  /  I  ) 

H(IPIVil)    =  CeSINE^HTEMP  +  S I NE *H (  I / UP  I  V ) 

IF    (    X(IPIV)    «   AES(   H(IPIV/I))    )  440/450/450 
440  X(  IPIV)    =   A3S(H(  IFIV/  I  )  ) 

I  G  (  I  P  I  V  )  =  I 

450  M(I/UPIV)    -   -SINE*HTEMP  +  C03 I NE *H (I , UP  I  V ) 
IF    (   X(I)    «   A95(   Hd/UPIV))    )  400/530/530 

4?'0  HTEM°  ?  H  {  IPIV#  I) 

HdPIV/I)    ^  CaSINE*HTEMP  +  S  I  NE*H  (  UP  I  V/ I  ) 


139 


IF    (    y(JPIV)    ^   aBS(    H(IPIV..  I))    )  ^90/500/500 
^90  X(  I  PPM    =   AB3(H(  IPI\>,  I  )  ) 
I  0  (  I      V  )    ^  I 

500  H{JPIVW)    r:   ^5lNf:*HTE:MP  +  CSS  I  NE  ^  H  (      I  V>  I  ) 

IF    (    X(JPIV)    «   ABS(   H(J^IV/I))    )  510J.530/530 
510  X(  JPIV)        AB3(H(0F  IV^  I  )  ) 

IG(JPIV)    "  I 
530  CONTINUE 

T^IST  FeR  CeMPUTATiaN  OF  £  I GENVECT8RS 

lF{lEnEN)  ^0/540#40 
5^t0  Oe  550  I:=l/N 

HTEMP^Ud.'.  IPiy  ) 

I'd  /  IFI  V)  :^CGSlNEn*TEMP-^SINE*U(  1/ JPiV) 
550  U(  I>  JPIV)=-'»SIME*-HTEMP4-CtiSINE'^U{  N  JPIV) 
GP  T3  40 
1000  PETURM 
FN'D 
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C.  APPtN-'DlX  G 

C 

C  -VARIMAX  F<t}TATlGN  F'ROGRAM 

C  FROM  COeUE.Y  AND  LOHNF.S 

C  THIS  P:-'^e]GRAM  WILL  PERFORM  VARIMAX  ROTATleN  PGR  UP   18  75 

C  c"ACT^^-^S   AND   100  VARIABL[-:S. 

c 

C  INPUT. 

C  ENTER  NE:ia   (C^LS  59  "  60)  « 

C  CAR^   1   Cr^NTAINS  PReB{CoC*    l-^^S)::  PROBLEM  IDCNTIFN 

C  CATION  I'iHiCH  MAY  BC  ALPHABETIC,   N    (C3L   1"4S)=   N9,  8F 

C  VARIABLES/    IF  Ni;0<r   CALL  EXIT^   L   (COLS  52-53)    =  NCU  dF 

C  FACT-'RS/   NVALUE    (COLS  5^-55)    =   Ne,   OF   ITERATION  CYCLES 

■C  (USE  00   IF  THIS  TEST   IS  NOT  DESIRED)   JTFST   (COL  56-57)  = 

C  h!'^.    OF  CYCLES  NEW  VARIA^'CE  REQUIRED  TO  EQUAL  OLD  VARl- 

C  ANCE>    ROPC    (CPL58)    -   0   IF   AN   INPUT  CARD  CONTAINS  THE 

C  LOADINGS  FOR   A   VARIABLE    (MATRIX   IS   IN  CARDS  AS  ROWS) 

C  RORC  =    1    IF  AN   INPUT  CARD  CONTAINS   THE  LOADINGS  FOR  A 

C   ■  PACThR    (MATRIX   IS   IN  CARDS  AS  CsLUMnS). 

C  IF    INPUT  DATA   IS  FROM  EIGENVECTORS  AND  NOT  FACTORS/ 

C  NEIG  =         OTHERi'ilSE  NEIO  =   1«      IF  NEIQ   is  0/  ENTER 

C  EIGEN^'ALUES   IN  ThP  ORDER  OF  RESPECTIVE  EIGENVECTORS  ON 

C  CAPD(S)    '"'HiCri  FOLLOW  ThE  ABOVE  DATA. 

C  LOADINGS  FOLLOW  CARD  1. 

D  I  MEN'S  I  e^'  A(    ?O^PC)^  V(50)/TV(50)-*H(    50)>AX(  50^20) 
1^HN(    ^0)^HD(    50)/PROL3(  12)/ElGVAL(20)i  AY(  50^20) 

COMMON  Ai  V,TV^NV/.NiL/FN^T/B>P 


loot   F^'RMA^(  17H0N0.  VARIANCE) 
100?  FeRMATdH  I3ir2Ct&) 

1003  FORMAT  (  40X^  16'--l0     FACTOR  MATRIX) 

1004  F0RMAT(9M0VAR!ABLEI4/) 

1005  FORMAT(  IH  lOFU 

1006  FORMAT(  1?A4/  13^312^  12) 

1007  FORMAT  (  5EK*  7  ) 

1008  F0RMAT(27H0GLD  H2     NEW  H2  DIFFERENCE/) 

1009  F^Rf^A^(F6»3^FS.3/F12<3i  18) 

1010  F^^RMAT(  (10XWF10*6)  ) 

10120F0RMAT(1HU26X*  37H  OUTPUT  FROM  VARIMAX  ROTATION  PROGRAM/ 
l/20Xi 12A4//) 

lC130F0RMA^(2lXi37H  EIGENVALUES  FROM  PRINCIPAL  COMPONENT/ 

19H   ANALYSIS  //(l^iX>6F10.6)  ) 
ICUOFORMAT  (IMO/  lifX;35H  NO.   ROTATIONS  REQUIRED  TO  MAXIMIZE^ 

120H  VARIVAX  CRITERION   =  /I3) 

1015  FORMAT ( 12A6) 

1016  FORMAT(  iiH0/33X/22H  ORIGINAL  EIGENVECTORS/) 

1017  FORMAT ( 14X/7H  VEC TOR . I  3/ ( 1 4X^ 6F 1 0 • 6 ) ) 

1018  F5RMAT( lHC/J3Xy 23H  ORIGINAL  FACTOR  MATRIX/) 
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1019  rOF<MAT(  F  AdOR^  I  3/ (  1 4X  i      1  0  *  6  )  ) 

1020  Fc^RMAT(l'-^0i30X^  P3H  FINAL  ROTATED  FACieR  MATRIX/) 

1021  F'^RMAT(  lH0/32V,j£4H  VARIANCE  ^F  EACH  F  AC     R// {  1  ^X/ 6r  1  0  .  6  )  ) 
10220FORMAT(iNon7y:^  37^-  PERCENT  AGt  9F  VARIANCE  ACCeUNTED  FdR, 

115H  Rv  E^CH  FACTaR//(  1^^>''*^^10*5)  ) 
1023  Fr:'R.MAT(  1H0/22X^23H  ACCU-^'UL  ATED  PERCE  \'T  AGE // (  1  ^fX^  6F  1  0  •  4  )  ) 
102'+0FPRMhT{  lM0/29Xi30'-^  RE[)UCED  FINAL  E^OTATED  FACTBRS/UX^ 

19M   VARIABLE.*    21Xi8H  FACTORS/) 

1025  FnR'lAi'(  i  ^X/  12^  3X^  lOFn.2  ) 

1026  F^'RM  AT  ( /I. -'-^Xi      9HV AR  I  AN'CESi  10F5*1) 

1027  F^?RMAT  (  (  1        6F10«  6  )  ) 

24  READ  (  1  05,^  1006)  (  PRe^B(  I  )  ,  I  ^  1  /  1  2  )  i     L  i  NVALUE  /  JTESTi  NReRC^  NE  I  G 
RRRC   =  NR^RC 

IF(N)  6000.6000^25 

25  I-RITE  (  108^1012). PR8B 
WRITE  (106^1012)..ERe)G 
IF    (R«RC)  26^26^28 

26  n-^^  27   I    =  liN 

27  REAO( 105. 1010) ( A(I/ J)/ J-l/L) 
06  T^  29 

2S         20     J=  liL 

20  READ(  lOE,^  1027)  (  A  (  li  J)  J- I  =  1^N) 

29  IF(NE!C131i3i.3C 

31  ^■EAD(  lOE;  1010)  (EIGVAL(  J)^' J=liL) 
WR!TE{  10^^#  1013)  (EIGVAL(  J);  J^a^L) 
WRITE ( 106* 1013) (EIGVAL( J) ^J^l^L) 
p)  p.  q  p  J 1 1  .i  L 

32  ETGVAL( J) =SQRT(EIGVAL( J) ) 

WRITE( lOE^ 1016) 
WRITE( 106> 1016) 
D^^  3^  Jr.liL 

l':RITE{  106^  1017)      {A(  li  J)^  I  =  1^N) 
3if  WP  I  TE(  108/  1017  )  J/  (  Adj.  J)  M  =  liN) 
n^^  33  J^l^L 
DO   33  I^'l^N 

33  A( N J)5EIGVAL( J)*A( I , J) 
WRITE{ 108i 101^) 
WPITE(106/ lOlB) 

D9  35  J^1.»L 

'aRITE(  106^  lOn)  Ji  (  A(  I#  J)i  I-liN) 
35  WRITE  (  10?i  1019  )  J/ (  A(  I  i  J)  .1  I  =  liN) 

30  EPS«0t00116 
NC  =  0 

TV( 1 ) =0.0 

LL=L"1 
NiUl 
FN^=^! 
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3  H  (  I  )    C  .  vn 
DO  4  J^^lyL 

^  CQKTINUE 
D  0  S   I    ,1  ,f  N 

H(  I )  ==snrrr  (H(  i )  ) 

5  A( 1/ J5 .A ( li J)/H(I  ) 
Da  6  1-1. iN 

6  AX(  1/  J) (I J  J) n 

L.  V  =  Ki  \i  -  1 

■  3  IF'(LV-f.VMUE) 

1,0  ir(.JTe3T)  13^l3,il6 

16  I  r  (  A^^S  (  TV  (  NV  )  -  T  V  (  L  V  )  )  -0  .  000000  1)      1 1 «  1 1  i  .1  3 

11  NC^^NC-^-l 

12  IF  (  MC-JTEBT  )      13.^999»?99_  _^        '  _ 

0=?  50O  J^l^LL 
II=^J-1 

De  30O  KM  I/L  ^ 
A  A  =  0  *  0 
B«  =  0O  ■ 
CC-0.0 
DD?0.n 
15 


J.  o 


YY  =  A (  <) 
U=  (  XX-f-YY  )    (XX«YY  ) 
W   ^  a^Q'^XX^-YY 
CCsCC+(U  +  W)  *(U-'W) 
DD==DD  +  2  «0*U*W 
AA  =  AA4-U 

15  CPNTINi.ie: 

T  =  DD'=2*0*AA^-BB/FN 
B=^CC- (  AA^  *?«B^s**2  )/FN 
P  =  0.2=5*ATANF  (T/B) 
TAN'+;^  =  T/R 

IF(T-F?)      1041/ U33.  10^2 
U33   IF(T-^B-C^^G)     500i  1043/ 1043 
1043  CSS4T  =  CQ';5 


U3 

Ge  Tfi  50GC 
lO^il  TA^^r^^ABR(T)/ABS(B) 

IF  (  T/'N^-T-tPS  )      SOCOi  IICO^  1100 
1  ICO  C^S4T^1  .C/SGRT  (  1  *     T  AN4  T  *  *  2  ) 

SI^J4T:^TA^^i^T•i^Cf:•S^T 

Ge   T6  5000 
8000   ir(B)  1150^500^i;C0 
1.150  SINPsCONS 

(;;;  p  q  p  s  r  P  N  ■  S 
TO  1000 
10^?  CTN4T=ABS(T)/AB3(B) 

ir(CTN4T-LPS)      yOOO/  1200,^  1200 
1200  SINitT  =  l-0/SQRT(  1 «  0  +  CTN^T^*2  ) 

C'^>S'J=TkCTN^4T*SIN4T 

Gf^  TO  5000 
90C0  CeS4T:=0»0 

SIN4T=1.0 

50C0  C^SaT^^SQRT  (  (1 ,  O^COS^r  J  ) /2  •  0  ) 
SIN2T  =  SIN^rT/ (  P,0-^C6S2T  ) 
C5ST  =  SQRT (  ( 1  ,o  +  COS2T )/2.0) 

GIMT=5IM2T/(2,0*CesT) 
IP"  (B  )  12b0^  1250^  1300 

1300  cesP=cesT 
GiNP  =  sir!r 

GO  TO  70''Q 
1250  C0SP  =  CeNS*C9ST-fC5NS^S  INT 

SIN^  =  ABS  {  CONS^CGGT-CONS^-SINT  ) 
7000    IF-  (  T  )  1400^  1400^  1000 
1400  5INP="5INP 

1000  x-coop 

,  Y=SINP 
Dd   100  I^l^N 

AIJ       ?A(  I#  J)*X-fA(  NK>*Y 

A  IK  ^«A(I^J)*Y"!-A(I#K)*X 

A  (I  /  J  )  A  IJ 
1 00  A ( I. K) ^  A  I K 
500  CeNTUiUE 

Ge  TO  222 
999  D'O  301    I  =  l/N 

301  HM( I ) =0.0 

Oe  303  I=liN 

H(n='H{n*H(i)  ' 

DP  302  J=l/L 

302  HN( I)=HN( I )+AX(I^ J)*AX(I/ J) 

303  HD{  I  )='HN(  I)''H(  I) 
WRITE(  lOS^  101^n^JV 
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WRITE( 10^^ 10^0) 
WRITE(  lO^^i  1020) 

"    DP   305  J=liL 
H  (  J  )  ^  0  «  0 

WPITE:  (  lO'^i  101  9        {  AX(  I  ^  J)^  I  ~1^N  ) 

WRlTr( 106/ 1019) Ji ( AX{ 1/ J)/ I^l/N) 

D5   30^  I«l/N 

AY(  I  /  J)  ^-AX  (  J) 

H(  J)---^M  J)^-AY(  I;  J) 

HM(  J)  ^  (H(  J)-^1  00.  )/XN 
3C5  CH.NTI'^MJE 

,  W^^ITr(  K}?^  1021  )  (H  (  J)  ^  J"  l^L  ) 

WPITE:(  106^.1021  )  (H  (  J)  ,  J:n/L  ) 

WPnT(  103^  102P)  (HN(  J  W  Jsl^L) 

H^niEi  lO^j  102?)  (Hi'(  J)/ J--=l/L) 

DP  306  J~?jL 
3C6  HN'(  J)=HN(  J)-!-HN(  J«l  ) 

W^MTE(  10^/  10P3)  (HN(  J)/ J--UiL) 

V;niTE(  101-/  10:-?3)  (HN(  J);  J=l/L) 

WRITE(103/ 1020) 

WRITE( 10-i 1020) 

WRITr(  lO'v^  102^^ ) 

Wf-MTF.  (  106/  102^^  ) 

DP  307  I^IjiN 

'v'f^IT":(  106j  1025)  I  j  (  AX(  1/ J)i  J=1/L) 
307  '-.'PITtK  108  /  1025)  I  I  (  AX(  li  J)/ J=l^L) 
'/JPIT^.(  10«i  1026)  (H(  J)i  J=l/L) 
!/f^ITE  (  106/  1026  )  (H  (  J)  /  J^liL  ) 

G^r  2ir 

000  CALL  EXIT 
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SUBROUTINE  Va^?''1aX 

DIMENSION  A(  b0.2C)^SA(?0)i.SA2(20)  iV(oO)^  TV(50) 
CeMM'n^  A/  V,TV^NV/^!<rL/FN.»  JjBj^' 

^.!  V    N  \''  1 

DP  6  vj-^^l/L 
S  A  (  J  )  K  0  C 

6  C^NTU'UE 
D?  S  J-^IjL 
De  7    I  «  i  ?  N 

SA(  J)==SA(  J)-^-A(  W  J)^A(  J) 

7  S  ^  2  (  J     SA    (  J  )  r  {  A  (  I  /  J  )  *  A  (  I  /  J  )  )  *^*2 
SA(  J)^SA{  J)*-'? 

8  V  (  J5  =  (rN'^SA?(  J)  «3A  (J)  )/F- N^f  *a 
Df^  9  JsIjL 

T  V  (  Ni  ^/  5  s  S  V 
RETURN! 
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APPENDIX  H 


MEANS  AND  STANDARD  DEVIATIONS  OF  VARIABLES 

VARIABLE  RAW  DATA  TRANSFORMED  DATA 

MEAN  STD.    DEV.  MEAN  STD.  DEV. 


A 

21.8  6298 

16.251465 

1  .254370 

0.251140 

on  H 

^.9oiV33 

r\      C  /,  D  '1  ~7  ~7 

u.64o2  /  / 

0.08o734 

AZ 

205.619995 

57.4  817  6  6 

2.295727 

0.  125083 

ELEV 

3496 . 000000 

619. 873047 

3.5  36823 

0.077241 

GiNDS 

0.04471 4 

0.02574  9 

-  1  .43483 1 

0. 285002 

GNDL 

0  .669614 

0. 240  575 

-0.209380 

0.1879  61 

FREQ 

12. 685501 

6 . 949473 

1.033  136 

0 . 256508 

L 

12.912598 

5.338099 

1  .07273C) 

0. 178872 

5 

0.00  7940 

0.004395 

-2  .  135564 

0 . 152  2  79 

US  E 

13.959315 

20.9^7428 

0.764478 

0*38  4  418 

I  In  F  K 

0.096311 

C.OOr'343 

-1.016577 

0*015277 

on  M  n 

r\    r\  1  Q  P  "7  /■ 

r\     .--i  n  Q  D  O  ^, 

—  ii.(Ji/i-5( 

I • Q^y  5  DD 

T 
I 

n  n  R  w  n  1  9 

n    ri  A  A  n  o  p 
u  .  U  O  H  'J  V  o 

1  .^'+  /  joU 

T  c  ri 
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